Maximizing Utilization of Global Memory Bandwidth