To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????{N}????????{N{^ 0011111100111111001111110011111100111111001111110011111100111111011110110100111001111101001111110011111100111111001111110011111100111111001111110011111101111011010011100111101101011110 3f3f3f3f3f3f3f3f7b4e7d3f3f3f3f3f3f3f3f7b4e7b5e
SJIS-WIN ??贄?????{N}??贄?????{N{^ 00111111001111111110011011010001001111110011111100111111001111110011111101111011010011100111110100111111001111111110011011010001001111110011111100111111001111110011111101111011010011100111101101011110 3f3fe6d13f3f3f3f3f7b4e7d3f3fe6d13f3f3f3f3f7b4e7b5e
EUC-JP ??贄?????{N}??贄?????{N{^ 00111111001111111110110011010011001111110011111100111111001111110011111101111011010011100111110100111111001111111110110011010011001111110011111100111111001111110011111101111011010011100111101101011110 3f3fecd33f3f3f3f3f7b4e7d3f3fecd33f3f3f3f3f7b4e7b5e
UTF-8 렱렚贄샬솽렱렚샴{N}렱렚贄샬솽렱렚샴{N{^ 11101011101000001011000111101011101000001001101011101000101101001000010011101100100000111010110011101100100001101011110111101011101000001011000111101011101000001001101011101100100000111011010001111011010011100111110111101011101000001011000111101011101000001001101011101000101101001000010011101100100000111010110011101100100001101011110111101011101000001011000111101011101000001001101011101100100000111011010001111011010011100111101101011110 eba0b1eba09ae8b484ec83acec86bdeba0b1eba09aec83b47b4e7deba0b1eba09ae8b484ec83acec86bdeba0b1eba09aec83b47b4e7b5e
UHC 렱렚贄샬솽렱렚샴{N}렱렚贄샬솽렱렚샴{N{^ 100011101011111010001110101011011111001010111110101111001010001110111100111000011000111010111110100011101010110110111100101001000111101101001110011111011000111010111110100011101010110111110010101111101011110010100011101111001110000110001110101111101000111010101101101111001010010001111011010011100111101101011110 8ebe8eadf2bebca3bce18ebe8eadbca47b4e7d8ebe8eadf2bebca3bce18ebe8eadbca47b4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)