To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 萬駈終茫縁蒐茫縁蒐D萬駈終茫縁蒐茫縁蒐D^ 111001001101110110001011111011011000111101001001111001001010100110001001100011111000111101001110111001001010100110001001100011111000111101001110010001001110010011011101100010111110110110001111010010011110010010101001100010011000111110001111010011101110010010101001100010011000111110001111010011100100010001011110 e4dd8bed8f49e4a9898f8f4ee4a9898f8f4e44e4dd8bed8f49e4a9898f8f4ee4a9898f8f4e445e
EUC-JP 萬駈終茫縁蒐茫縁蒐D萬駈終茫縁蒐茫縁蒐D^ 111010001101111110110110111011111011110110101010111010001010101110110001111011111011110110101111111010001010101110110001111011111011110110101111010001001110100011011111101101101110111110111101101010101110100010101011101100011110111110111101101011111110100010101011101100011110111110111101101011110100010001011110 e8dfb6efbdaae8abb1efbdafe8abb1efbdaf44e8dfb6efbdaae8abb1efbdafe8abb1efbdaf445e
UTF-8 萬駈終茫縁蒐茫縁蒐D萬駈終茫縁蒐茫縁蒐D^ 111010001001000010101100111010011010011110001000111001111011010110000010111010001000110010101011111001111011100010000001111010001001001010010000111010001000110010101011111001111011100010000001111010001001001010010000010001001110100010010000101011001110100110100111100010001110011110110101100000101110100010001100101010111110011110111000100000011110100010010010100100001110100010001100101010111110011110111000100000011110100010010010100100000100010001011110 e890ace9a788e7b582e88cabe7b881e89290e88cabe7b881e8929044e890ace9a788e7b582e88cabe7b881e89290e88cabe7b881e89290445e
UHC 萬?終茫?蒐茫?蒐D萬?終茫?蒐茫?蒐D^ 110110001011111100111111111100001111101111011000110101000011111111100010101111011101100011010100001111111110001010111101010001001101100010111111001111111111000011111011110110001101010000111111111000101011110111011000110101000011111111100010101111010100010001011110 d8bf3ff0fbd8d43fe2bdd8d43fe2bd44d8bf3ff0fbd8d43fe2bdd8d43fe2bd445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)