To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????r[????????r[^ 001111110011111100111111001111110011111100111111001111110011111101110010010110110011111100111111001111110011111100111111001111110011111100111111011100100101101101011110 3f3f3f3f3f3f3f3f725b3f3f3f3f3f3f3f3f725b5e
SJIS-WIN 娃??渦??厓?r[娃??渦??厓?r[^ 100010001010000100111111001111111000100101010001001111110011111111111010100011010011111101110010010110111000100010100001001111110011111110001001010100010011111100111111111110101000110100111111011100100101101101011110 88a13f3f89513f3ffa8d3f725b88a13f3f89513f3ffa8d3f725b5e
EUC-JP 娃??渦??厓?r[娃??渦??厓?r[^ 1011000010100011001111110011111110110001101100100011111100111111100011111011010011000111001111110111001001011011101100001010001100111111001111111011000110110010001111110011111110001111101101001100011100111111011100100101101101011110 b0a33f3fb1b23f3f8fb4c73f725bb0a33f3fb1b23f3f8fb4c73f725b5e
UTF-8 娃붷춼渦경뿦厓쯉r[娃붷춼渦경뿦厓쯉r[^ 1110010110101000100000111110101110110110101101111110110010110110101111001110011010111000101001101110101010110010101111011110101110111111101001101110010110001110100100111110110010101111100010010111001001011011111001011010100010000011111010111011011010110111111011001011011010111100111001101011100010100110111010101011001010111101111010111011111110100110111001011000111010010011111011001010111110001001011100100101101101011110 e5a883ebb6b7ecb6bce6b8a6eab2bdebbfa6e58e93ecaf89725be5a883ebb6b7ecb6bce6b8a6eab2bdebbfa6e58e93ecaf89725b5e
UHC 娃붷춼渦경뿦厓쯉r[娃붷춼渦경뿦厓쯉r[^ 11101000110111111001010011100101101011011001100011101000101111101011000011100110100101111010011011100100111011011010100101000101011100100101101111101000110111111001010011100101101011011001100011101000101111101011000011100110100101111010011011100100111011011010100101000101011100100101101101011110 e8df94e5ad98e8beb0e697a6e4eda945725be8df94e5ad98e8beb0e697a6e4eda945725b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)