To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ◆??ヂ????ぁ莎ァ??ケ??ぱ◆??ヂ? 10000001100111110011111100111111100000110110000100111111001111110011111100111111100000101001111111100100101100111000001101000000001111110011111110000011010100000011111100111111100000101100111110000001100111110011111100111111100000110110000100111111 819f3f3f83613f3f3f3f829fe4b383403f3f83503f3f82cf819f3f3f83613f
EUC-JP ◆??ヂ????ぁ莎ァ??ケ??ぱ◆??ヂ? 10100010101000010011111100111111101001011100001000111111001111110011111100111111101001001010000111101000101101011010010110100001001111110011111110100101101100010011111100111111101001001101000110100010101000010011111100111111101001011100001000111111 a2a13f3fa5c23f3f3f3fa4a1e8b5a5a13f3fa5b13f3fa4d1a2a13f3fa5c23f
UTF-8 ◆룶죴ヂ룶즽㈒룶ぁ莎ァ룴횕ケ룶첂ぱ◆룶죴ヂ룶 111000101001011110000110111010111010001110110110111011001010001110110100111000111000001110000010111010111010001110110110111011001010011010111101111000111000100010010010111010111010001110110110111000111000000110000001111010001000111010001110111000111000001010100001111010111010001110110100111011011001101010010101111000111000001010110001111010111010001110110110111011001011001010000010111000111000000110110001111000101001011110000110111010111010001110110110111011001010001110110100111000111000001110000010111010111010001110110110 e29786eba3b6eca3b4e38382eba3b6eca6bde38892eba3b6e38181e88e8ee382a1eba3b4ed9a95e382b1eba3b6ecb282e381b1e29786eba3b6eca3b4e38382eba3b6
UHC ◆룶죴ヂ룶즽㈒룶ぁ莎ァ룴횕ケ룶첂ぱ◆룶죴ヂ룶 1010000111011111100011111010101110100001100011111010101111000010100011111010101110100011100011111010100111000011100011111010101110101010101000011101111011101101101010111010000110001111101010011100001110001111101010111011000110001111101010111010101010001111101010101101000110100001110111111000111110101011101000011000111110101011110000101000111110101011 a1df8faba18fabc28faba38fa9c38fabaaa1deedaba18fa9c38fabb18fabaa8faad1a1df8faba18fabc28fab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)