To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鵝??茹g?仰???????ぃ泳?????^ 11101010010000000011111100111111111001001010010110000010100001110011111110001011110000100011111100111111001111110011111100111111001111110011111110000010101000011000100101101010001111110011111100111111001111110011111101011110 ea403f3fe4a582873f8bc23f3f3f3f3f3f3f82a1896a3f3f3f3f3f5e
EUC-JP 鵝??茹g?仰??轝????ぃ泳?????^ 111100111010000100111111001111111110100010100111101000111110011100111111101101101100010000111111001111111000111111100001101010100011111100111111001111110011111110100100101000111011000111001011001111110011111100111111001111110011111101011110 f3a13f3fe8a7a3e73fb6c43f3f8fe1aa3f3f3f3fa4a3b1cb3f3f3f3f3f5e
UTF-8 鵝싲젉茹g텚仰뜻쓻轝뽨퐱溜뗦ぃ泳볟댍呂뽪쵔^ 11101001101101011001110111101100100010111011001011101100101000001000100111101000100011001011100111101111101111011000011111101101100001011001101011100100101110111011000011101011100111001011101111101100100100111011101111101000101111011001110111101011101111011010100011101101100100001011000111101111101001111000101111101011100101111010011011100011100000011000001111100110101100111011001111101011101100111001111111101011100011001000110111101111101001101000000011101011101111011010101011101100101101011001010001011110 e9b59dec8bb2eca089e88cb9efbd87ed859ae4bbb0eb9cbbec93bbe8bd9debbda8ed90b1efa78beb97a6e38183e6b3b3ebb39feb8c8defa680ebbdaaecb5945e
UHC 鵝싲젉茹g텚仰뜻쓻轝뽨퐱溜뗦ぃ泳볟댍呂뽪쵔^ 11100100101111011001101011101011101000001000101111100110101010101010001111100111101101101001001111100100111001101011011011100110100111011001011011100110101011001001011011100100101111011001101011101010111111101000101111100110101010101010001111100111101101101001001111100101100010001011011011100101111110111001011011100110101011001001011001011110 e4bd9aeba08be6aaa3e7b693e4e6b6e69d96e6ac96e4bd9aeafe8be6aaa3e7b693e588b6e5fb96e6ac965e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)