To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?????????甸???????????^ 0011111100111111001111110011111100111111001111110011111100111111001111111001100110110010001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f99b23f3f3f3f3f3f3f3f3f3f3f5e
EUC-JP ?????????甸?????薏?????^ 00111111001111110011111100111111001111110011111100111111001111110011111111010010101101000011111100111111001111110011111100111111100011111101100111011110001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3fd2b43f3f3f3f3f8fd9de3f3f3f3f3f5e
UTF-8 溜삳젍溜살쓻溜뗫졎甸묐졎溜롫졎薏뽯젉溜삼쨵^ 11101111101001111000101111101100100000101011001111101100101000001000110111101111101001111000101111101100100000101011010011101100100100111011101111101111101001111000101111101011100101111010101111101100101000011000111011100111100101001011100011101011101011001001000011101100101000011000111011101111101001111000101111101011101000011010101111101100101000011000111011101000100101101000111111101011101111011010111111101100101000001000100111101111101001111000101111101100100000101011110011101100101010001011010101011110 efa78bec82b3eca08defa78bec82b4ec93bbefa78beb97abeca18ee794b8ebac90eca18eefa78beba1abeca18ee8968febbdafeca089efa78bec82bceca8b55e
UHC 溜삳젍溜살쓻溜뗫졎甸묐졎溜롫졎薏뽯젉溜삼쨵^ 11101010111111101011101111101011101000001000111011101010111111101011101111101100100111011001011011101010111111101000101111101011101000001011101111101111101001001001000111101011101000001011101111101010111111101000111011101011101000001011101111101011111110111001011011101011101000001000101111101010111111101011101111101111101001001000111101011110 eafebbeba08eeafebbec9d96eafe8beba0bbefa491eba0bbeafe8eeba0bbebfb96eba08beafebbefa48f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)