To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 暗??裕?暗??裕?^ 100010001100001100111111001111111001011101010100001111111000100011000011001111110011111110010111010101000011111101011110 88c33f3f97543f88c33f3f97543f5e
EUC-JP 暗??裕?暗??裕?^ 101100001100010100111111001111111100110110110101001111111011000011000101001111110011111111001101101101010011111101011110 b0c53f3fcdb53fb0c53f3fcdb53f5e
UTF-8 暗삳렇裕뉲暗삳렇裕뉲^ 11100110100110101001011111101100100000101011001111101011101000001000011111101000101000111001010111101011100010011011001011100110100110101001011111101100100000101011001111101011101000001000011111101000101000111001010111101011100010011011001001011110 e69a97ec82b3eba087e8a395eb89b2e69a97ec82b3eba087e8a395eb89b25e
UHC 暗삳렇裕뉲暗삳렇裕뉲^ 111001001101111010111011111010111011011110111000111010111010111010001000010001001110010011011110101110111110101110110111101110001110101110101110100010000100010001011110 e4debbebb7b8ebae8844e4debbebb7b8ebae88445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)