To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?®???????B 00111111101011100011111100111111001111110011111100111111001111110011111101000010 3fae3f3f3f3f3f3f3f42
SJIS-WIN 馭??鸚?????B 111010010110011000111111001111111110101001011111001111110011111100111111001111110011111101000010 e9663f3fea5f3f3f3f3f3f42
EUC-JP 馭®?鸚?????B 1111000111000111100011111010001011101110001111111111001111000000001111110011111100111111001111110011111101000010 f1c78fa2ee3ff3c03f3f3f3f3f42
UTF-8 馭®몓鸚룡걿烈곩ㅇB 111010011010011010101101110000101010111011101011101010101001001111101001101110001001101011101011101000111010000111101010101100011011111111101111101001101001111111101010101100111010100111100011100001011000011101000010 e9a6adc2aeebaa93e9b89aeba3a1eab1bfefa69feab3a9e3858742
UHC 馭®몓鸚룡걿烈곩ㅇB 11100101110111111010001011100111100100011000000111100101101001001011011111100110100000011010001011100110111011111000000111100101101001001011011101000010 e5dfa2e79181e5a4b7e681a2e6ef81e5a4b742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)