To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????] 00111111001111110011111100111111001111110011111100111111001111110011111101011101 3f3f3f3f3f3f3f3f3f5d
SJIS-WIN 堯??魚??絶??] 11101010100111110011111100111111100010111001101100111111001111111001000011100010001111110011111101011101 ea9f3f3f8b9b3f3f90e23f3f5d
EUC-JP 堯??魚??絶??] 11110100101000010011111100111111101101011111101100111111001111111100000011100100001111110011111101011101 f4a13f3fb5fb3f3fc0e43f3f5d
UTF-8 堯억푵魚됮넠絶붻뻗] 11100101101000001010111111101100100101101011010111101101100100011011010111101001101011011001101011101011100100001010111011101011100001001010000011100111101101011011011011101011101101101011101111101011101110111001011101011101 e5a0afec96b5ed91b5e9ad9aeb90aeeb84a0e7b5b6ebb6bbebbb975d
UHC 堯억푵魚됮넠絶붻뻗] 11101000111010111011111011101111101111101000001111100101111000001000100111101001100001101010010011101111101111101001010011101000101110111011100001011101 e8ebbeefbe83e5e089e986a4efbe94e8bbb85d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)