To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?暴??蓬睹?恨宏?暴??蓬睹?恨槐^ 0011111110010110010111000011111100111111100101100100100011100001110011110011111110001101101001101000110101000111001111111001011001011100001111110011111110010110010010001110000111001111001111111000110110100110100111101100010101011110 3f965c3f3f9648e1cf3f8da68d473f965c3f3f9648e1cf3f8da69ec55e
EUC-JP ?暴??蓬睹?恨宏?暴??蓬睹?恨槐^ 0011111111001011101111010011111100111111110010111010100111100010110100010011111110111010101010001011100110101000001111111100101110111101001111110011111111001011101010011110001011010001001111111011101010101000110111001100011101011110 3fcbbd3f3fcba9e2d13fbaa8b9a83fcbbd3f3fcba9e2d13fbaa8dcc75e
UTF-8 뤋暴쭗샘蓬睹뤋恨宏뤋暴쭗샘蓬睹뤋恨槐^ 11101011101001001000101111100110100110101011010011101100101011011001011111101100100000111001100011101000100100111010110011100111100111011011100111101011101001001000101111100110100000011010100011100101101011101000111111101011101001001000101111100110100110101011010011101100101011011001011111101100100000111001100011101000100100111010110011100111100111011011100111101011101001001000101111100110100000011010100011100110101001111001000001011110 eba48be69ab4ecad97ec8398e893ace79db9eba48be681a8e5ae8feba48be69ab4ecad97ec8398e893ace79db9eba48be681a8e6a7905e
UHC 뤋暴쭗샘蓬睹뤋恨宏뤋暴쭗샘蓬睹뤋恨槐^ 10001111101110111111100011101100101001111000111110111011111110011101110011101111110101001010100110001111101110111111100111001111110011101101101110001111101110111111100011101100101001111000111110111011111110011101110011101111110101001010100110001111101110111111100111001111110011101101100101011110 8fbbf8eca78fbbf9dcefd4a98fbbf9cfcedb8fbbf8eca78fbbf9dcefd4a98fbbf9cfced95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)