To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????~B 0011111100111111001111110011111100111111001111110111111001000010 3f3f3f3f3f3f7e42
SJIS-WIN 熬??悅λ?~B 1110000010010010001111110011111111111010101111011000001111001001001111110111111001000010 e0923f3ffabd83c93f7e42
EUC-JP 熬???λ?~B 11011111111100100011111100111111001111111010011011001011001111110111111001000010 dff23f3f3fa6cb3f7e42
UTF-8 熬곷젒悅λ젻~B 11100111100001101010110011101010101100111011011111101100101000001001001011100110100000101000010111001110101110111110110010100000101110110111111001000010 e786aceab3b7eca092e68285cebbeca0bb7e42
UHC 熬곷젒悅λ젻~B 1110100010100010100000011110101110100000100100011110011011101101101001011110101110100000101011100111111001000010 e8a281eba091e6eda5eba0ae7e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)