To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 擾→?焉??節 1000111111101111100000011010100000111111111000001000000100111111001111111001000011011111 8fef81a83fe0813f3f90df
EUC-JP 擾→?焉??節 1011111011110001101000101010101000111111110111111110000100111111001111111100000011100001 bef1a2aa3fdfe13f3fc0e1
UTF-8 擾→닁焉숃쪓節 111001101001001110111110111000101000011010010010111010111000101110000001111001111000010010001001111011001000100010000011111011001010101010010011111001111010111110000000 e693bee28692eb8b81e78489ec8883ecaa93e7af80
UHC 擾→닁焉숃쪓節 1110100011110110101000011110011010001000100010101110010111101010100110011110100010100101100011011110111110111101 e8f6a1e6888ae5ea99e8a58defbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)