To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 馭??媛??遺?? 111010010110011000111111001111111001010101010001001111110011111110001000111000100011111100111111 e9663f3f95513f3f88e23f3f
EUC-JP 馭??媛??遺?? 111100011100011100111111001111111100100110110010001111110011111110110000111001000011111100111111 f1c73f3fc9b23f3fb0e43f3f
UTF-8 馭곥룂媛쇔㎤遺우컝 111010011010011010101101111010101011001110100101111010111010001110000010111001011010101010011011111011001000011110010100111000111000111010100100111010011000000110111010111011001001101010110000111011001011101110011101 e9a6adeab3a5eba382e5aa9bec8794e38ea4e981baec9ab0ecbb9d
UHC 馭곥룂媛쇔㎤遺우컝 111001011101111110000001111000111000111110000011111010101011000010111100111001011010011110101000111010111011011010111111111011001011000010001000 e5df81e38f83eab0bce5a7a8ebb6bfecb088

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)