To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 沃??蘖?????鳶??耶??兀??藥?? 100101111000000000111111001111111001111101010000001111110011111100111111001111110011111110010011110011100011111100111111100101101110101100111111001111111001100101011001001111110011111111100101010110100011111100111111 97803f3f9f503f3f3f3f3f93ce3f3f96eb3f3f99593f3fe55a3f3f
EUC-JP 沃??蘖?????鳶??耶??兀??藥?? 110011011110000000111111001111111101110110110001001111110011111100111111001111110011111111000110110100000011111100111111110011001110110100111111001111111101000110111010001111110011111111101001101110110011111100111111 cde03f3fddb13f3f3f3f3fc6d03f3fcced3f3fd1ba3f3fe9bb3f3f
UTF-8 沃겼겢蘖띹갬掠욃춼鳶멨뜵耶섋꽦兀덂컟藥썼갬 111001101011001010000011111010101011001010111100111010101011001010100010111010001001100010010110111010111001110110111001111010101011000010101100111011111010010110110101111011001001101010000011111011001011011010111100111010011011001110110110111010111010100110101000111010111001110010110101111010001000000010110110111011001000010010001011111010101011110110100110111001011000010110000000111010111000110110000010111011001011101110011111111010001001011110100101111011001000110110111100111010101011000010101100 e6b283eab2bceab2a2e89896eb9db9eab0acefa5b5ec9a83ecb6bce9b3b6eba9a8eb9cb5e880b6ec848beabda6e58580eb8d82ecbb9fe897a5ec8dbceab0ac
UHC 沃겼겢蘖띹갬掠욃춼鳶멨뜵耶섋꽦兀덂컟藥썼갬 111010001010101010110000111001011000000110110100111001011110111010001101111010001011000010110111111001011011000110011110111001011010110110011000111001101110100110111000111001011000110110110011111001011010110110011000111010001000010010110001111010001011010010001000111001011011000010001010111001011011011110111101111010001011000010110111 e8aab0e581b4e5ee8de8b0b7e5b19ee5ad98e6e9b8e58db3e5ad98e884b1e8b488e5b08ae5b7bde8b0b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)