To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ??檣?日橋? 00111111001111111001111011111100001111111001001111111010100010111011010000111111 3f3f9efc3f93fa8bb43f
EUC-JP 佾?檣?日橋? 100011111011000011111011001111111101110011111110001111111100011011111100101101101011011000111111 8fb0fb3fdcfe3fc6fcb6b63f
UTF-8 佾렋檣렫日橋률 111001001011110110111110111010111010000010001011111001101010101010100011111010111010000010101011111001101001011110100101111001101010100110001011111010111010010110100000 e4bdbeeba08be6aaa3eba0abe697a5e6a98beba5a0
UHC 佾렋檣렫日橋률 1110110011101011100011101010001011101101111010101000111010111001111011001110110111001110111010011011011111111100 eceb8ea2edea8eb9ecedcee9b7fc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)