To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????D 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f44
SJIS-WIN 癲ル?悠??醫??語⑥????袁??肄?Ⅹ鷹??D 1110000110011111100000111000101100111111100101110100100100111111001111111110011111001110001111110011111110001100111010101000011101000101001111110011111100111111001111111110010111001101001111110011111111100011111001010011111110000111010111011001000111101001001111110011111101000100 e19f838b3f97493f3fe7ce3f3f8cea87453f3f3f3fe5cd3f3fe3e53f875d91e93f3f44
EUC-JP 癲ル?悠??醫??語?????袁??肄??鷹??D 111000101010000110100101111010110011111111001101101010100011111100111111111011101101000000111111001111111011100011101100001111110011111100111111001111110011111111101010110011110011111100111111111001101110011100111111001111111100001011101011001111110011111101000100 e2a1a5eb3fcdaa3f3feed03f3fb8ec3f3f3f3f3feacf3f3fe6e73f3fc2eb3f3f44
UTF-8 癲ル슓悠㏆쭗醫댿봼語⑥궢留뗨짅袁딄쑨肄욑Ⅹ鷹낇뇠D 11100111100110011011001011100011100000111010101111101100100010101001001111100110100000101010000011100011100011111000011011101100101011011001011111101001100001101010101111101011100011001011111111101011101101001011110011101000101010101001111011100010100100011010010111101010101101101010001011101111101001111000110111101011100101111010100011101100101001111000010111101000101000101000000111101011100101001000010011101100100100011010100011101000100000101000010011101100100110101001000111100010100001011010100111101001101101111011100111101011100000101000011111101011100001111010000001000100 e799b2e383abec8a93e682a0e38f86ecad97e986abeb8cbfebb4bce8aa9ee291a5eab6a2efa78deb97a8eca785e8a281eb9484ec91a8e88284ec9a91e285a9e9b7b9eb8287eb87a044
UHC 癲ル슓悠㏆쭗醫댿봼語⑥궢留뗨짅袁딄쑨肄욑Ⅹ鷹낇뇠D 11101111101001101010101111101011100110101010001011101010111011011010011111101111101001111000111111101100101000101000100011100010100101001000001111100101110111101010100011101100100000101011010111101011101001111000101111101000101000111001010011101010101111101000101011101010101111101010011111101100101111011001111011101111101001011011100111101011111011011000010111101101100001111000100001000100 efa6abeb9aa2eaeda7efa78feca288e29483e5dea8ec82b5eba78be8a394eabe8aeabea7ecbd9eefa5b9ebed85ed878844

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)