To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 癲???ょ?音??n}癲???ょ?音??n{^ 1110000110011111001111110011111100111111100000101110010100111111100010011011100100111111001111110110111001111101111000011001111100111111001111110011111110000010111001010011111110001001101110010011111100111111011011100111101101011110 e19f3f3f3f82e53f89b93f3f6e7de19f3f3f3f82e53f89b93f3f6e7b5e
EUC-JP 癲???ょ?音??n}癲???ょ?音??n{^ 1110001010100001001111110011111100111111101001001110011100111111101100101011101100111111001111110110111001111101111000101010000100111111001111110011111110100100111001110011111110110010101110110011111100111111011011100111101101011110 e2a13f3f3fa4e73fb2bb3f3f6e7de2a13f3f3fa4e73fb2bb3f3f6e7b5e
UTF-8 癲뽰뇯痢ょ땸音얩닂n}癲뽰뇯痢ょ땸音얩닂n{^ 1110011110011001101100101110101110111101101100001110101110000111101011111110111110100111101001011110001110000010100001111110101110010101101110001110100110011111101100111110110010010110101010011110101110001011100000100110111001111101111001111001100110110010111010111011110110110000111010111000011110101111111011111010011110100101111000111000001010000111111010111001010110111000111010011001111110110011111011001001011010101001111010111000101110000010011011100111101101011110 e799b2ebbdb0eb87afefa7a5e38287eb95b8e99fb3ec96a9eb8b826e7de799b2ebbdb0eb87afefa7a5e38287eb95b8e99fb3ec96a9eb8b826e7b5e
UHC 癲뽰뇯痢ょ땸音얩닂n}癲뽰뇯痢ょ땸音얩닂n{^ 1110111110100110100101101110110010000111100101001110110010111000101010101110011110001011100011101110101111100101101111101110110110001000100010110110111001111101111011111010011010010110111011001000011110010100111011001011100010101010111001111000101110001110111010111110010110111110111011011000100010001011011011100111101101011110 efa696ec8794ecb8aae78b8eebe5beed888b6e7defa696ec8794ecb8aae78b8eebe5beed888b6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)