To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????U??U}?????U??U{^ 001111110011111100111111001111110011111101010101001111110011111101010101011111010011111100111111001111110011111100111111010101010011111100111111010101010111101101011110 3f3f3f3f3f553f3f557d3f3f3f3f3f553f3f557b5e
SJIS-WIN 嶸ィ巐啄乘U嶸ゥU}嶸ィ巐啄乘U嶸ゥU{^ 11111010101101001010100011111010101101101001000111101101100110001010100101010101111110101011010010101001010101010111110111111010101101001010100011111010101101101001000111101101100110001010100101010101111110101011010010101001010101010111101101011110 fab4a8fab691ed98a955fab4a9557dfab4a8fab691ed98a955fab4a9557b5e
EUC-JP 嶸ィ巐啄乘U嶸ゥU}嶸ィ巐啄乘U嶸ゥU{^ 1000111110111011111101001000111010101000100011111011101111111001110000101110111111010000101010110101010110001111101110111111010010001110101010010101010101111101100011111011101111110100100011101010100010001111101110111111100111000010111011111101000010101011010101011000111110111011111101001000111010101001010101010111101101011110 8fbbf48ea88fbbf9c2efd0ab558fbbf48ea9557d8fbbf48ea88fbbf9c2efd0ab558fbbf48ea9557b5e
UTF-8 嶸ィ巐啄乘U嶸ゥU}嶸ィ巐啄乘U嶸ゥU{^ 11100101101101101011100011101111101111011010100011100101101101111001000011100101100101011000010011100100101110011001100001010101111001011011011010111000111011111011110110101001010101010111110111100101101101101011100011101111101111011010100011100101101101111001000011100101100101011000010011100100101110011001100001010101111001011011011010111000111011111011110110101001010101010111101101011110 e5b6b8efbda8e5b790e59584e4b99855e5b6b8efbda9557de5b6b8efbda8e5b790e59584e4b99855e5b6b8efbda9557b5e
UHC 嶸??啄乘U嶸?U}嶸??啄乘U嶸?U{^ 1110011110101110001111110011111111110110111100101110001110101011010101011110011110101110001111110101010101111101111001111010111000111111001111111111011011110010111000111010101101010101111001111010111000111111010101010111101101011110 e7ae3f3ff6f2e3ab55e7ae3f557de7ae3f3ff6f2e3ab55e7ae3f557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)