To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????[????????[^ 00111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 汐暹汐鮮汐暹汐鮮[汐暹汐鮮汐暹汐鮮[^ 1000111010101100100111011111100110001110101011001001000101001110100011101010110010011101111110011000111010101100100100010100111001011011100011101010110010011101111110011000111010101100100100010100111010001110101011001001110111111001100011101010110010010001010011100101101101011110 8eac9df98eac914e8eac9df98eac914e5b8eac9df98eac914e8eac9df98eac914e5b5e
EUC-JP 汐暹汐鮮汐暹汐鮮[汐暹汐鮮汐暹汐鮮[^ 1011110010101110110110101111101110111100101011101100000110101111101111001010111011011010111110111011110010101110110000011010111101011011101111001010111011011010111110111011110010101110110000011010111110111100101011101101101011111011101111001010111011000001101011110101101101011110 bcaedafbbcaec1afbcaedafbbcaec1af5bbcaedafbbcaec1afbcaedafbbcaec1af5b5e
UTF-8 汐暹汐鮮汐暹汐鮮[汐暹汐鮮汐暹汐鮮[^ 111001101011000110010000111001101001101010111001111001101011000110010000111010011010111010101110111001101011000110010000111001101001101010111001111001101011000110010000111010011010111010101110010110111110011010110001100100001110011010011010101110011110011010110001100100001110100110101110101011101110011010110001100100001110011010011010101110011110011010110001100100001110100110101110101011100101101101011110 e6b190e69ab9e6b190e9aeaee6b190e69ab9e6b190e9aeae5be6b190e69ab9e6b190e9aeaee6b190e69ab9e6b190e9aeae5b5e
UHC 汐暹汐鮮汐暹汐鮮[汐暹汐鮮汐暹汐鮮[^ 1110000010110001111000001110011111100000101100011110000011011000111000001011000111100000111001111110000010110001111000001101100001011011111000001011000111100000111001111110000010110001111000001101100011100000101100011110000011100111111000001011000111100000110110000101101101011110 e0b1e0e7e0b1e0d8e0b1e0e7e0b1e0d85be0b1e0e7e0b1e0d8e0b1e0e7e0b1e0d85b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)