To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋??玉??雅??譯??節?塋??玉??雅 1001101011001000001111110011111110001011110010100011111100111111100010011110101100111111001111111110011010100001001111110011111110010000110111110011111110011010110010000011111100111111100010111100101000111111001111111000100111101011 9ac83f3f8bca3f3f89eb3f3fe6a13f3f90df3f9ac83f3f8bca3f3f89eb
EUC-JP 塋??玉??雅??譯??節?塋??玉??雅 1101010011001010001111110011111110110110110011000011111100111111101100101110110100111111001111111110110010100011001111110011111111000000111000010011111111010100110010100011111100111111101101101100110000111111001111111011001011101101 d4ca3f3fb6cc3f3fb2ed3f3feca33f3fc0e13fd4ca3f3fb6cc3f3fb2ed
UTF-8 塋뉛슁玉붺뵽雅딂예譯귨슬節킾塋뉛슁玉붺뵽雅 111001011010000110001011111010111000100110011011111011001000101010000001111001111000111010001001111010111011011010111010111010111011010110111101111010011001101110000101111010111001010010000010111011001001100010001000111010001010110110101111111010101011011110101000111011001000101010101100111001111010111110000000111011011000001010111110111001011010000110001011111010111000100110011011111011001000101010000001111001111000111010001001111010111011011010111010111010111011010110111101111010011001101110000101 e5a18beb899bec8a81e78e89ebb6baebb5bde99b85eb9482ec9888e8adafeab7a8ec8aace7af80ed82bee5a18beb899bec8a81e78e89ebb6baebb5bde99b85
UHC 塋뉛슁玉붺뵽雅딂예譯귨슬節킾塋뉛슁玉붺뵽雅 111001111010101110000111111011111011110110110011111010001010110010010100111001111001010010111011111001001011101010001010111010001011111110111001111001101011101110000010111011111011110110111101111011111011110110110101011010001110011110101011100001111110111110111101101100111110100010101100100101001110011110010100101110111110010010111010 e7ab87efbdb3e8ac94e794bbe4ba8ae8bfb9e6bb82efbdbdefbdb568e7ab87efbdb3e8ac94e794bbe4ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)