Category: Reading

  • Weight-sparse transformers have interpretable circuits

    12 Page 3 test. sparse models contain small, disentangled circuits that are both understandable and sufficient to perform the behavior. Overall Setup Plot of nterpretability versus capability Test: quotation | Understanding neural networks through sparse circuits[↩]Gao, Leo, Achyuta Rajaram, Jacob Coxon, Soham V. Govande, Bowen Baker, and Dan Mossing. “Weight-sparse transformers have interpretable circuits.” arXiv preprint…

  • Research Papers Manifestation

    As a researcher myself, reading papers are one part of the job. Although there are some resources and materials present ways to read papers, sometimes I still don’t feel that they are pracitical and cannot help me digest the papers properly. Before going further, I need to recommend the book ” How to read a…