================= CRYOSPARCW ======= 2021-07-20 18:36:51.746393 ========= Project P17 Job J762 Master jptitan Port 39002 =========================================================================== ========= monitor process now starting main process MAINPROCESS PID 979414 MAIN PID 979414 refine.newrun cryosparc_compute.jobs.jobregister ========= monitor process now waiting for main process ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat *************************************************************** Running job J762 of type nonuniform_refine_new Running job on hostname %s jptitan Allocated Resources : {'fixed': {'SSD': True}, 'hostname': 'jptitan', 'lane': 'default', 'lane_type': 'default', 'license': True, 'licenses_acquired': 1, 'slots': {'CPU': [0, 1, 2, 3], 'GPU': [0], 'RAM': [0, 1, 2]}, 'target': {'cache_path': '/scratch', 'cache_quota_mb': None, 'cache_reserve_mb': 10000, 'desc': None, 'gpus': [{'id': 0, 'mem': 11554717696, 'name': 'GeForce RTX 2080 Ti'}, {'id': 1, 'mem': 11554717696, 'name': 'GeForce RTX 2080 Ti'}, {'id': 2, 'mem': 11554324480, 'name': 'GeForce RTX 2080 Ti'}], 'hostname': 'jptitan', 'lane': 'default', 'monitor_port': None, 'name': 'jptitan', 'resource_fixed': {'SSD': True}, 'resource_slots': {'CPU': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63], 'GPU': [0, 1, 2], 'RAM': [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15]}, 'ssh_str': 'jparmache@jptitan', 'title': 'Worker node jptitan', 'type': 'node', 'worker_bin_path': '/data/software/cryosparc/cryosparc2_worker/bin/cryosparcw'}} HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256========= sending heartbeat grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (342, 1, 2007, 81) 218 block size 256 grid size (342, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (342, 1, 672, 44) 894 block size 256 grid size (342, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (342, 1, 224, 24) 2362 block size 256 grid size (342, 14, 1) global compute_resid_pow with (342, 1, 80, 12) 2362 block size 256 grid size (342, 5, 1) global compute_resid_pow with (342, 1, 32, 8) 2362 block size 256 grid size (342, 2, 1) global compute_resid_pow with (342, 1, 16, 4) 2362 block size 256 grid size (342, 1, 1) global compute_resid_pow with (342, 1, 8, 4) 2362 block size 128 grid size (342, 8, 1) global compute_resid_pow with (342, 1, 19, 21) 2362 block size 256 grid size (342, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256========= sending heartbeat grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362========= sending heartbeat block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (343, 1, 2007, 81) 218 block size 256 grid size (343, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (343, 1, 672, 44) 894 block size 256 grid size (343, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (343, 1, 224, 24) 2362 block size 256 grid size (343, 14, 1) global compute_resid_pow with (343, 1, 80, 12) 2362 block size 256 grid size (343, 5, 1) global compute_resid_pow with (343, 1, 32, 8) 2362 block size 256 grid size (343, 2, 1) global compute_resid_pow with (343, 1, 16, 4) 2362 block size 256 grid size (343, 1, 1) global compute_resid_pow with (343, 1, 8, 4) 2362 block size 128 grid size (343, 8, 1) global compute_resid_pow with (343, 1, 19, 21) 2362 block size 256 grid size (343, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256========= sending heartbeat grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (342, 1, 2007, 81) 218 block size 256 grid size (342, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (342, 1, 672, 44) 894 block size 256 grid size (342, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (342, 1, 224, 24) 2362 block size 256 grid size (342, 14, 1) global compute_resid_pow with (342, 1, 80, 12) 2362 block size 256 grid size (342, 5, 1) global compute_resid_pow with (342, 1, 32, 8) 2362 block size 256 grid size (342, 2, 1) global compute_resid_pow with (342, 1, 16, 4) 2362 block size 256 grid size (342, 1, 1) global compute_resid_pow with (342, 1, 8, 4) 2362 block size 128 grid size (342, 8, 1) global compute_resid_pow with (342, 1, 19, 21) 2362 block size 256 grid size (342, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256========= sending heartbeat grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362========= sending heartbeat ========= sending heartbeat block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 2362 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 2362 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 2362 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 2362 block size 128 grid size (500, 8, 1) global compute_resid_pow with (343, 1, 2007, 81) 218 block size 256 grid size (343, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 2362 block size 256 grid size (500, 2, 1) global compute_resid_pow with (343, 1, 672, 44) 894 block size 256 grid size (343, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (343, 1, 224, 24) 2362 block size 256 grid size (343, 14, 1) global compute_resid_pow with (343, 1, 80, 12) 2362 block size 256 grid size (343, 5, 1) global compute_resid_pow with (343, 1, 32, 8) 2362 block size 256 grid size (343, 2, 1) global compute_resid_pow with (343, 1, 16, 4) 2362 block size 256 grid size (343, 1, 1) global compute_resid_pow with (343, 1, 8, 4) 2362 block size 128 grid size (343, 8, 1) global compute_resid_pow with (343, 1, 19, 21) 2362 block size 256 grid size (343, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 77.877 radwn. 0.5 at 46.311 radwn. Took 3.422s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 85.285 radwn. 0.5 at 71.648 radwn. Took 3.228s. FSC Loose Mask... ========= sending heartbeat 0.143 at 95.636 radwn. 0.5 at 79.222 radwn. Took 12.241s. FSC Tight Mask... ========= sending heartbeat 0.143 at 98.482 radwn. 0.5 at 85.875 radwn. Took 10.014s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (428, 1, 2007, 81) 218 block size 256 grid size (428, 126, 1) global compute_resid_pow with (428, 1, 672, 44) 894 block size 256 grid size (428, 42, 1) global compute_resid_pow with (428, 1, 224, 24) 3604 block size 256 grid size (428, 14, 1) global compute_resid_pow with (428, 1, 80, 12) 14456 block size 256 grid size (428, 5, 1) global compute_resid_pow with (428, 1, 32, 8) 15242 block size 256 grid size (428, 2, 1) global compute_resid_pow with (428, 1, 16, 4) 15242 block size 256 grid size (428, 1, 1) global compute_resid_pow with (428, 1, 8, 4) 15242 block size 128 grid size (428, 8, 1) global compute_resid_pow with (428, 1, 19, 21) 15242 block size 256 grid size (428, 2, 1) global compute_resid_pow with (428, 1, 19, 21) 15242 block size 256 grid size (428, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (429, 1, 2007, 81) 218 block size 256 grid size (429, 126, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (429, 1, 672, 44) 894 block size 256 grid size (429, 42, 1) global compute_resid_pow with (429, 1, 224, 24) 3604 block size 256 grid size (429, 14, 1) global compute_resid_pow with (429, 1, 80, 12) 14456 block size 256 grid size (429, 5, 1) global compute_resid_pow with (429, 1, 32, 8) 15242 block size 256 grid size (429, 2, 1) global compute_resid_pow with (429, 1, 16, 4) 15242 block size 256 grid size (429, 1, 1) global compute_resid_pow with (429, 1, 8, 4) 15242 block size 128 grid size (429, 8, 1) global compute_resid_pow with (429, 1, 19, 21) 15242 block size 256 grid size (429, 2, 1) global compute_resid_pow with (429, 1, 19, 21) 15242 block size 256 grid size (429, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (428, 1, 2007, 81) 218 block size 256 grid size (428, 126, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (428, 1, 672, 44) 894 block size 256 grid size (428, 42, 1) global compute_resid_pow with (428, 1, 224, 24) 3604 block size 256 grid size (428, 14, 1) global compute_resid_pow with (428, 1, 80, 12) 14456 block size 256 grid size (428, 5, 1) global compute_resid_pow with (428, 1, 32, 8) 15242 block size 256 grid size (428, 2, 1) global compute_resid_pow with (428, 1, 16, 4) 15242 block size 256 grid size (428, 1, 1) global compute_resid_pow with (428, 1, 8, 4) 15242 block size 128 grid size (428, 8, 1) global compute_resid_pow with (428, 1, 19, 21) 15242 block size 256 grid size (428, 2, 1) global compute_resid_pow with (428, 1, 19, 21) 15242 block size 256 grid size (428, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 15242 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 15242 block size 128 grid size (500, 8, 1) global compute_resid_pow with (429, 1, 2007, 81) 218 block size 256 grid size (429, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 15242 block size 256 grid size (500, 2, 1) global compute_resid_pow with (429, 1, 672, 44) 894 block size 256 grid size (429, 42, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (429, 1, 224, 24) 3604 block size 256 grid size (429, 14, 1) global compute_resid_pow with (429, 1, 80, 12) 14456 block size 256 grid size (429, 5, 1) global compute_resid_pow with (429, 1, 32, 8) 15242 block size 256 grid size (429, 2, 1) global compute_resid_pow with (429, 1, 16, 4) 15242 block size 256 grid size (429, 1, 1) global compute_resid_pow with (429, 1, 8, 4) 15242 block size 128 grid size (429, 8, 1) global compute_resid_pow with (429, 1, 19, 21) 15242 block size 256 grid size (429, 2, 1) global compute_resid_pow with (429, 1, 19, 21) 15242 block size 256 grid size (429, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 93.904 radwn. 0.5 at 75.805 radwn. Took 2.660s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 97.195 radwn. 0.5 at 80.690 radwn. Took 3.055s. FSC Loose Mask... ========= sending heartbeat 0.143 at 101.325 radwn. 0.5 at 92.601 radwn. Took 12.240s. FSC Tight Mask... ========= sending heartbeat 0.143 at 107.382 radwn. 0.5 at 97.617 radwn. Took 10.322s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (2, 1, 2007, 81) 218 block size 256 grid size (2, 126, 1) global compute_resid_pow with (2, 1, 672, 44) 894 block size 256 grid size (2, 42, 1) global compute_resid_pow with (2, 1, 224, 24) 3604 block size 256 grid size (2, 14, 1) global compute_resid_pow with (2, 1, 80, 12) 14456 block size 256 grid size (2, 5, 1) global compute_resid_pow with (2, 1, 32, 8) 18120 block size 256 grid size (2, 2, 1) global compute_resid_pow with (2, 1, 16, 4) 18120 block size 256 grid size (2, 1, 1) global compute_resid_pow with (2, 1, 8, 4) 18120 block size 128 grid size (2, 8, 1) global compute_resid_pow with (2, 1, 19, 21) 18120 block size 256 grid size (2, 2, 1) global compute_resid_pow with (2, 1, 19, 21) 18120 block size 256 grid size (2, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) ========= sending heartbeat 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12)========= sending heartbeat 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (3, 1, 2007, 81) 218 block size 256 grid size (3, 126, 1) global compute_resid_pow with (3, 1, 672, 44) 894 block size 256 grid size (3, 42, 1) global compute_resid_pow with (3, 1, 224, 24) 3604 block size 256 grid size (3, 14, 1) global compute_resid_pow with (3, 1, 80, 12) 14456 block size 256 grid size (3, 5, 1) global compute_resid_pow with (3, 1, 32, 8) 18120 block size 256 grid size (3, 2, 1) global compute_resid_pow with (3, 1, 16, 4) 18120 block size 256 grid size (3, 1, 1) global compute_resid_pow with (3, 1, 8, 4) 18120 block size 128 grid size (3, 8, 1) global compute_resid_pow with (3, 1, 19, 21) 18120 block size 256 grid size (3, 2, 1) global compute_resid_pow with (3, 1, 19, 21) 18120 block size 256 grid size (3, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (2, 1, 2007, 81) 218 block size 256 grid size (2, 126, 1) global compute_resid_pow with (2, 1, 672, 44) 894 block size 256 grid size (2, 42, 1) global compute_resid_pow with (2, 1, 224, 24) 3604 block size 256 grid size (2, 14, 1) global compute_resid_pow with (2, 1, 80, 12) 14456 block size 256 grid size (2, 5, 1) global compute_resid_pow with (2, 1, 32, 8) 18120 block size 256 grid size (2, 2, 1) global compute_resid_pow with (2, 1, 16, 4) 18120 block size 256 grid size (2, 1, 1) global compute_resid_pow with (2, 1, 8, 4) 18120 block size 128 grid size (2, 8, 1) global compute_resid_pow with (2, 1, 19, 21) 18120 block size 256 grid size (2, 2, 1) global compute_resid_pow with (2, 1, 19, 21) 18120 block size 256 grid size (2, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat ========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (3, 1, 2007, 81) 218 block size 256 grid size (3, 126, 1) global compute_resid_pow with (3, 1, 672, 44) 894 block size 256 grid size (3, 42, 1) global compute_resid_pow with (3, 1, 224, 24) 3604 block size 256 grid size (3, 14, 1) global compute_resid_pow with (3, 1, 80, 12) 14456 block size 256 grid size (3, 5, 1) global compute_resid_pow with (3, 1, 32, 8) 18120 block size 256 grid size (3, 2, 1) global compute_resid_pow with (3, 1, 16, 4) 18120 block size 256 grid size (3, 1, 1) global compute_resid_pow with (3, 1, 8, 4) 18120 block size 128 grid size (3, 8, 1) global compute_resid_pow with (3, 1, 19, 21) 18120 block size 256 grid size (3, 2, 1) global compute_resid_pow with (3, 1, 19, 21) 18120 block size 256 grid size (3, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18120 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18120 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18120 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 96.405 radwn. 0.5 at 77.620 radwn. Took 3.606s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 99.409 radwn. 0.5 at 86.052 radwn. Took 3.273s. FSC Loose Mask... ========= sending heartbeat 0.143 at 104.052 radwn. 0.5 at 96.269 radwn. Took 13.409s. FSC Tight Mask... ========= sending heartbeat 0.143 at 109.011 radwn. 0.5 at 100.388 radwn. Took 9.917s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global========= sending heartbeat ========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 18656 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 18656 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 18656 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 18656 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 18656 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat ========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 18656 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 18656 block size 256 grid size (35, 1, 1) global compute_resid_pow with (35, 1, 8, 4) 18656 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 18656 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 18656 block size 256 grid size (35, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 18656 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 18656 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 18656 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 18656 block size 256 grid size========= sending heartbeat (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 18656 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat ========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 18656 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 18656 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 18656 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 18656 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 18656 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 18656 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 18656 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 18656 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 101.374 radwn. 0.5 at 93.166 radwn. Took 3.203s. FSC Spherical Mask... 0.143 at 104.325 radwn. 0.5 at 96.847 radwn. Took 2.862s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 110.008 radwn. 0.5 at 101.363 radwn. Took 10.474s. FSC Tight Mask... ========= sending heartbeat 0.143 at 116.829 radwn. 0.5 at 106.312 radwn. Took 9.840s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 116.498 radwn. 0.5 at 105.931 radwn. Took 20.500s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 21320 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 21320 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 21320 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 21320 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 21320 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 21320 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 21320 block size 256 grid size (35, 1, 1) global compute_resid_pow with (35, 1, 8, 4) 21320 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 21320 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 21320 block size 256 grid size (35, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 21320 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 21320 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 21320 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 21320 block size 256 grid size========= sending heartbeat (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 21320 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat ========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 21320 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 21320 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 21320 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 21320 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 21320 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21320 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21320 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21320 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 101.701 radwn. 0.5 at 93.748 radwn. Took 2.938s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 104.598 radwn. 0.5 at 97.168 radwn. Took 3.290s. FSC Loose Mask... ========= sending heartbeat 0.143 at 111.948 radwn. 0.5 at 101.692 radwn. Took 13.538s. FSC Tight Mask... ========= sending heartbeat 0.143 at 117.665 radwn. 0.5 at 106.751 radwn. Took 11.312s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 117.103 radwn. 0.5 at 106.463 radwn. Took 19.889s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in cufft.Plan.__del__: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 21542 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 21542 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 21542 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 21542 block size 256 grid size (34, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (34, 1, 19, 21) 21542 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 21542 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 21542 block size 256 grid size (35, 1, 1) global compute_resid_pow with (35, 1, 8, 4) 21542 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 21542 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 21542 block size 256 grid size (35, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 21542 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 21542 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 21542 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 21542 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 21542 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat ========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 21542 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 21542 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 21542 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 21542 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 21542 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 21542 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 21542 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 21542 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 103.360 radwn. 0.5 at 95.861 radwn. Took 2.731s. FSC Spherical Mask... 0.143 at 108.249 radwn. 0.5 at 98.826 radwn. Took 3.151s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 116.961 radwn. 0.5 at 103.458 radwn. Took 11.487s. FSC Tight Mask... ========= sending heartbeat 0.143 at 124.014 radwn. 0.5 at 110.396 radwn. Took 10.728s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 123.689 radwn. 0.5 at 110.238 radwn. Took 21.330s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: divide by zero encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in cufft.Plan.__del__: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 24020 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 24020 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 24020 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 24020 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 24020 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 24020 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 24020 block size 256 grid size (35, 1, 1) global compute_resid_pow with (35, 1, 8, 4) 24020 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 24020 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 24020 block size 256 grid size (35, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 24020 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 24020 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 24020 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 24020 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 24020 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat ========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 24020 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 24020 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 24020 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 24020 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 24020 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 24020 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 24020 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 24020 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 104.161 radwn. 0.5 at 96.226 radwn. Took 2.337s. FSC Spherical Mask... 0.143 at 109.318 radwn. 0.5 at 99.295 radwn. Took 3.121s. FSC Loose Mask... ========= sending heartbeat ========= sending heartbeat 0.143 at 119.580 radwn. 0.5 at 104.482 radwn. Took 10.786s. FSC Tight Mask... ========= sending heartbeat 0.143 at 127.039 radwn. 0.5 at 111.701 radwn. Took 10.797s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 126.923 radwn. 0.5 at 111.530 radwn. Took 21.221s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in cufft.Plan.__del__: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 25306 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 25306 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 25306 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 25306 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 25306 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size========= sending heartbeat 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 25306 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 25306 block size 256 grid size (35, 1, 1) global compute_resid_pow with (35, 1, 8, 4) 25306 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 25306 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 25306 block size 256 grid size (35, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size ========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 25306 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 25306 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 25306 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 25306 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 25306 block size 256 grid size (34, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8)========= sending heartbeat 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size========= sending heartbeat 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size ========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44)========= sending heartbeat 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size========= sending heartbeat 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size========= sending heartbeat (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 25306 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 25306 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 25306 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 25306 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 25306 block size 256 grid size (34, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 25306 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 25306 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 25306 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... ========= sending heartbeat 0.143 at 104.967 radwn. 0.5 at 96.460 radwn. Took 2.286s. FSC Spherical Mask... 0.143 at 109.722 radwn. 0.5 at 99.610 radwn. Took 2.807s. FSC Loose Mask... ========= sending heartbeat 0.143 at 120.442 radwn. 0.5 at 105.135 radwn. Took 10.334s. FSC Tight Mask... ========= sending heartbeat 0.143 at 128.655 radwn. 0.5 at 112.008 radwn. Took 10.044s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 128.815 radwn. 0.5 at 111.767 radwn. Took 20.316s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in cufft.Plan.__del__: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size ========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 26062 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 26062 block size 256 grid size (34, 1, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (34, 1, 8, 4) 26062 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 26062 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 26062 block size 256 grid size (34, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 26062 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 26062 block size 256 grid size (35, 1, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (35, 1, 8, 4) 26062 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 26062 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 26062 block size 256 grid size (35, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4)========= sending heartbeat 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 26062 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 26062 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 26062 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 26062 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 26062 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat ========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 26062 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 26062 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 26062 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 26062 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 26062 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26062 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26062 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26062 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... 0.143 at 105.568 radwn. 0.5 at 96.590 radwn. Took 2.265s. FSC Spherical Mask... 0.143 at 109.838 radwn. 0.5 at 99.761 radwn. Took 2.802s. FSC Loose Mask... ========= sending heartbeat 0.143 at 120.526 radwn. 0.5 at 105.506 radwn. Took 10.169s. FSC Tight Mask... ========= sending heartbeat 0.143 at 129.618 radwn. 0.5 at 112.048 radwn. Took 9.939s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat 0.143 at 129.708 radwn. 0.5 at 111.909 radwn. Took 22.436s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in cufft.Plan.__del__: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 min: -1.000000 max: 1.000000 exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size========= sending heartbeat (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (34, 1, 32, 8) 26424 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 26424 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 26424 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 26424 block size 256 grid size (34, 2, 1) exception in cufft.Plan.__del__: global compute_resid_pow with (34, 1, 19, 21) 26424 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (35, 1, 2007, 81) 218 block size 256 grid size (35, 126, 1) global compute_resid_pow with (35, 1, 672, 44) 894 block size 256 grid size (35, 42, 1) global compute_resid_pow with (35, 1, 224, 24) 3604 block size 256 grid size (35, 14, 1) global compute_resid_pow with (35, 1, 80, 12) 14456 block size 256 grid size (35, 5, 1) global compute_resid_pow with (35, 1, 32, 8) 26424 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 16, 4) 26424 block size 256 grid size (35, 1, 1) global compute_resid_pow with (35, 1, 8, 4) 26424 block size 128 grid size (35, 8, 1) global compute_resid_pow with (35, 1, 19, 21) 26424 block size 256 grid size (35, 2, 1) global compute_resid_pow with (35, 1, 19, 21) 26424 block size 256 grid size (35, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81)========= sending heartbeat 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size========= sending heartbeat 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 26424 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 26424 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 26424 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 26424 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 26424 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' exception in force_free_cufft_plan: 'NoneType' object has no attribute 'handle' global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size========= sending heartbeat (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global========= sending heartbeat compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21)========= sending heartbeat 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size========= sending heartbeat 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size========= sending heartbeat (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1)========= sending heartbeat global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4)========= sending heartbeat 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size========= sending heartbeat 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size========= sending heartbeat (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) ========= sending heartbeat global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with========= sending heartbeat (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 224, 24)========= sending heartbeat 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size========= sending heartbeat 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 2007, 81) 218 block size 256 grid size (500, 126, 1) global compute_resid_pow with (34, 1, 2007, 81) 218 block size 256 grid size (34, 126, 1) global compute_resid_pow with (34, 1, 672, 44) 894 block size 256 grid size (34, 42, 1) global compute_resid_pow with (34, 1, 224, 24) 3604 block size 256 grid size (34, 14, 1) global compute_resid_pow with (34, 1, 80, 12) 14456 block size 256 grid size (34, 5, 1) global compute_resid_pow with (34, 1, 32, 8) 26424 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 16, 4) 26424 block size 256 grid size (34, 1, 1) global compute_resid_pow with (34, 1, 8, 4) 26424 block size 128 grid size (34, 8, 1) global compute_resid_pow with (34, 1, 19, 21) 26424 block size 256 grid size (34, 2, 1) global compute_resid_pow with (34, 1, 19, 21) 26424 block size 256 grid size (34, 2, 1) exception in force_free_cufft_plan: exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: global compute_resid_pow with (500, 1, 672, 44) 894 block size 256 grid size (500, 42, 1) global compute_resid_pow with (500, 1, 224, 24) 3604 block size 256 grid size (500, 14, 1) global compute_resid_pow with (500, 1, 80, 12) 14456 block size 256 grid size (500, 5, 1) global compute_resid_pow with (500, 1, 32, 8) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 16, 4) 26424 block size 256 grid size (500, 1, 1) global compute_resid_pow with (500, 1, 8, 4) 26424 block size 128 grid size (500, 8, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) global compute_resid_pow with (500, 1, 19, 21) 26424 block size 256 grid size (500, 2, 1) exception in cufft.Plan.__del__: FSC No-Mask... ========= sending heartbeat 0.143 at 105.796 radwn. 0.5 at 96.642 radwn. Took 2.306s. FSC Spherical Mask... 0.143 at 109.876 radwn. 0.5 at 99.831 radwn. Took 2.921s. FSC Loose Mask... ========= sending heartbeat 0.143 at 120.696 radwn. 0.5 at 105.710 radwn. Took 11.886s. FSC Tight Mask... ========= sending heartbeat 0.143 at 130.152 radwn. 0.5 at 111.999 radwn. Took 10.704s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 130.191 radwn. 0.5 at 111.536 radwn. Took 21.540s. ---- Computing FSC with mask 2.00 to 6.00 FSC No-Mask... ========= sending heartbeat 0.143 at 105.796 radwn. 0.5 at 96.642 radwn. Took 1.949s. FSC Spherical Mask... 0.143 at 109.876 radwn. 0.5 at 99.831 radwn. Took 2.753s. FSC Loose Mask... ========= sending heartbeat 0.143 at 120.696 radwn. 0.5 at 105.710 radwn. Took 10.093s. FSC Tight Mask... ========= sending heartbeat 0.143 at 135.398 radwn. 0.5 at 118.504 radwn. Took 10.645s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 132.688 radwn. 0.5 at 112.688 radwn. Took 21.158s. ---- Computing FSC with mask 2.25 to 7.00 FSC No-Mask... ========= sending heartbeat 0.143 at 105.796 radwn. 0.5 at 96.642 radwn. Took 1.954s. FSC Spherical Mask... 0.143 at 109.876 radwn. 0.5 at 99.831 radwn. Took 2.787s. FSC Loose Mask... ========= sending heartbeat 0.143 at 120.696 radwn. 0.5 at 105.710 radwn. Took 9.774s. FSC Tight Mask... ========= sending heartbeat 0.143 at 133.595 radwn. 0.5 at 116.925 radwn. Took 10.051s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 132.251 radwn. 0.5 at 113.717 radwn. Took 21.600s. ---- Computing FSC with mask 2.50 to 8.00 FSC No-Mask... 0.143 at 105.796 radwn. 0.5 at 96.642 radwn. Took 1.931s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 109.876 radwn. 0.5 at 99.831 radwn. Took 2.753s. FSC Loose Mask... ========= sending heartbeat 0.143 at 120.696 radwn. 0.5 at 105.710 radwn. Took 9.811s. FSC Tight Mask... ========= sending heartbeat 0.143 at 132.942 radwn. 0.5 at 115.588 radwn. Took 9.870s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 132.797 radwn. 0.5 at 113.363 radwn. Took 19.888s. ---- Computing FSC with mask 2.75 to 9.00 FSC No-Mask... 0.143 at 105.796 radwn. 0.5 at 96.642 radwn. Took 1.974s. FSC Spherical Mask... ========= sending heartbeat 0.143 at 109.876 radwn. 0.5 at 99.831 radwn. Took 3.072s. FSC Loose Mask... ========= sending heartbeat 0.143 at 120.696 radwn. 0.5 at 105.710 radwn. Took 9.820s. FSC Tight Mask... ========= sending heartbeat 0.143 at 132.495 radwn. 0.5 at 114.376 radwn. Took 10.042s. FSC Noise Sub... ========= sending heartbeat ========= sending heartbeat 0.143 at 132.375 radwn. 0.5 at 113.360 radwn. Took 21.512s. ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat ========= sending heartbeat HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: HOST ALLOCATION FUNCTION: using cudrv.pagelocked_empty exception in cufft.Plan.__del__: exception in cufft.Plan.__del__: *************************************************************** /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/deps/anaconda/envs/cryosparc_worker_env/lib/python3.7/multiprocessing/process.py:99: MatplotlibDeprecationWarning: Passing non-integers as three-element position specification is deprecated since 3.3 and will be removed two minor releases later. self._target(*self._args, **self._kwargs) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:945: RuntimeWarning: invalid value encountered in true_divide fsc_true = (fsc_t - fsc_n) / (1.0 - fsc_n) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/sigproc.py:649: FutureWarning: `rcond` parameter will change to the default of machine precision times ``max(M, N)`` where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass `rcond=None`, to keep using the old, explicitly pass `rcond=-1`. x = n.linalg.lstsq(w.reshape((-1,1))*A, w*b)[0] /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:285: RuntimeWarning: divide by zero encountered in log logabs = n.log(n.abs(fM)) /data/software/cryosparc/cryosparc2_worker/cryosparc_compute/plotutil.py:27: RuntimeWarning: invalid value encountered in sqrt cradwn = n.sqrt(cradwn) ========= main process now complete. ========= monitor process now complete.