To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???巡??矣??冗??萸??醫??夭?? 001111110011111100111111100011111000010000111111001111111110000111100001001111110011111110001111111001110011111100111111111001001100111000111111001111111110011111001110001111110011111110011010111011100011111100111111 3f3f3f8f843f3fe1e13f3f8fe73f3fe4ce3f3fe7ce3f3f9aee3f3f
EUC-JP ???巡??矣??冗??萸??醫??夭?? 001111110011111100111111101111011110010000111111001111111110001011100011001111110011111110111110111010010011111100111111111010001101000000111111001111111110111011010000001111110011111111010100111100000011111100111111 3f3f3fbde43f3fe2e33f3fbee93f3fe8d03f3feed03f3fd4f03f3f
UTF-8 麗몃쓹巡붻린矣낅룆冗밸맪萸룟죰醫딆퐟夭곗걖 111011111010011010001000111010111010101010000011111011001001001110111001111001011011011110100001111010111011011010111011111010111010011010110000111001111001111110100011111010111000001010000101111010111010001110000110111001011000011010010111111010111011000010111000111010111010011110101010111010001001000010111000111010111010001110011111111011001010001110110000111010011000011010101011111010111001010010000110111011011001000010011111111001011010010010101101111010101011001110010111111010101011000110010110 efa688ebaa83ec93b9e5b7a1ebb6bbeba6b0e79fa3eb8285eba386e58697ebb0b8eba7aae890b8eba39feca3b0e986abeb9486ed909fe5a4adeab397eab196
UHC 麗몃쓹巡붻린矣낅룆冗밸맪萸룟죰醫딆퐟夭곗걖 111001101011000010111000111010111001110110010101111000101101111010010100111010001011100010110000111010111111100010000101111010111000111110000101111010011011011110111001111010111001000010110010111010111010110110110111111001011010000110001011111011001010001010001010111011001011110110001000111010001110110010110000111011001000000110000001 e6b0b8eb9d95e2de94e8b8b0ebf885eb8f85e9b7b9eb90b2ebadb7e5a18beca28aecbd88e8ecb0ec8181

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)