Beyond Autoregressive: Exploring Alternative Architectures for LLMs