To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????d}?????????d{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110010001111101001111110011111100111111001111110011111100111111001111110011111100111111011001000111101101011110 3f3f3f3f3f3f3f3f3f647d3f3f3f3f3f3f3f3f3f647b5e
SJIS-WIN 只陌????錚??d}只陌????錚??d{^ 1001000111111100111010001001100100111111001111110011111100111111111010000100001000111111001111110110010001111101100100011111110011101000100110010011111100111111001111110011111111101000010000100011111100111111011001000111101101011110 91fce8993f3f3f3fe8423f3f647d91fce8993f3f3f3fe8423f3f647b5e
EUC-JP 只陌????錚??d}只陌????錚??d{^ 1100001011111110111011111111100100111111001111110011111100111111111011111010001100111111001111110110010001111101110000101111111011101111111110010011111100111111001111110011111111101111101000110011111100111111011001000111101101011110 c2feeff93f3f3f3fefa33f3f647dc2feeff93f3f3f3fefa33f3f647b5e
UTF-8 只陌렞희렰렮錚띈㉡d}只陌렞희렰렮錚띈㉡d{^ 1110010110001111101010101110100110011001100011001110101110100000100111101110110110011101101011001110101110100000101100001110101110100000101011101110100110001100100110101110101110011101100010001110001110001001101000010110010001111101111001011000111110101010111010011001100110001100111010111010000010011110111011011001110110101100111010111010000010110000111010111010000010101110111010011000110010011010111010111001110110001000111000111000100110100001011001000111101101011110 e58faae9998ceba09eed9daceba0b0eba0aee98c9aeb9d88e389a1647de58faae9998ceba09eed9daceba0b0eba0aee98c9aeb9d88e389a1647b5e
UHC 只陌렞희렰렮錚띈㉡d}只陌렞희렰렮錚띈㉡d{^ 1111000111111110110110001110100010001110101011111100100011110001100011101011110110001110101110111110111010110110101101101110100010101000101100100110010001111101111100011111111011011000111010001000111010101111110010001111000110001110101111011000111010111011111011101011011010110110111010001010100010110010011001000111101101011110 f1fed8e88eafc8f18ebd8ebbeeb6b6e8a8b2647df1fed8e88eafc8f18ebd8ebbeeb6b6e8a8b2647b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)