To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????C 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000011 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f43
SJIS-WIN ??????????????????野??C 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111001011011101100001111110011111101000011 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f96ec3f3f43
EUC-JP ??????????????????野??C 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111100110011101110001111110011111101000011 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fccee3f3f43
UTF-8 溜삳젣溜븀츧留ㅻ젣溜뷩뀛溜쒕졎溜믩졋野띄꽇C 11101111101001111000101111101100100000101011001111101100101000001010001111101111101001111000101111101011101110001000000011101100101110001010011111101111101001111000110111100011100001011011101111101100101000001010001111101111101001111000101111101011101101111010100111101011100000001001101111101111101001111000101111101100100100101001010111101100101000011000111011101111101001111000101111101011101011111010100111101100101000011000101111101001100001111000111011101011100111011000010011101010101111011000011101000011 efa78bec82b3eca0a3efa78bebb880ecb8a7efa78de385bbeca0a3efa78bebb7a9eb809befa78bec9295eca18eefa78bebafa9eca18be9878eeb9d84eabd8743
UHC 溜삳젣溜븀츧留ㅻ젣溜뷩뀛溜쒕졎溜믩졋野띄꽇C 11101010111111101011101111101011101000001001110011101010111111101011101011100111101011101001110111101011101001111010010011101011101000001001110011101010111111101011101011100011100001011001010011101010111111101001110011101011101000001011101111101010111111101001001011101011101000001011101011100101101011111011011011100111100001001001100101000011 eafebbeba09ceafebae7ae9deba7a4eba09ceafebae38594eafe9ceba0bbeafe92eba0bae5afb6e7849943

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)