To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???偃??厓よ????偃??榮??泳?? 001111110011111100111111100110001110111000111111001111111111101010001101100000101110011000111111001111110011111100111111100110001110111000111111001111111001111011000100001111110011111110001001011010100011111100111111 3f3f3f98ee3f3ffa8d82e63f3f3f3f98ee3f3f9ec43f3f896a3f3f
EUC-JP ???偃??厓よ????偃??榮??泳?? 00111111001111110011111111010000111100000011111100111111100011111011010011000111101001001110100000111111001111110011111100111111110100001111000000111111001111111101110011000110001111110011111110110001110010110011111100111111 3f3f3fd0f03f3f8fb4c7a4e83f3f3f3fd0f03f3fdcc63f3fb1cb3f3f
UTF-8 遼깅젨偃띾젿厓よ쾴溜롫젨偃띾젿榮녿젩泳롰벢 111011111010011110000011111010101011100110000101111011001010000010101000111001011000000110000011111010111001110110111110111011001010000010111111111001011000111010010011111000111000001010001000111011001011111010110100111011111010011110001011111010111010000110101011111011001010000010101000111001011000000110000011111010111001110110111110111011001010000010111111111001101010011010101110111010111000010110111111111011001010000010101001111001101011001110110011111010111010000110110000111010111011001010100010 efa783eab985eca0a8e58183eb9dbeeca0bfe58e93e38288ecbeb4efa78beba1abeca0a8e58183eb9dbeeca0bfe6a6aeeb85bfeca0a9e6b3b3eba1b0ebb2a2
UHC 遼깅젨偃띾젿厓よ쾴溜롫젨偃띾젿榮녿젩泳롰벢 111010011010110010110001111010111010000010100000111001011110011110001101111010111010000010110001111001001110110110101010111010001011001010001010111010101111111010001110111010111010000010100000111001011110011110001101111010111010000010110001111001111011010010000110111010111010000010100001111001111011011010001110111011011001001110111011 e9acb1eba0a0e5e78deba0b1e4edaae8b28aeafe8eeba0a0e5e78deba0b1e7b486eba0a1e7b68eed93bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)