To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 涯??猷??悠 10001010010101010011111100111111100101110101000100111111001111111001011101001001 8a553f3f97513f3f9749
EUC-JP 涯?ʼn猷??悠 101100111011011000111111100011111010100111001010110011011011001000111111001111111100110110101010 b3b63f8fa9cacdb23f3fcdaa
UTF-8 涯쇰ʼn猷뚪썚悠 1110011010110110101011111110110010000111101100001100010110001001111001111000110010110111111010111001101010101010111011001000110110011010111001101000001010100000 e6b6afec87b0c589e78cb7eb9aaaec8d9ae682a0
UHC 涯쇰ʼn猷뚪썚悠 1110010011110011101111001110101110101001101100001110101110100011100011001110100110011011100011011110101011101101 e4f3bceba9b0eba38ce99b8deaed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)