To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 鳥??同??雰肌 100100101011100100111111001111111001001110101111001111110011111110010101101101011001010010100111 92b93f3f93af3f3f95b594a7
EUC-JP 鳥??同庾?雰肌 1100010010111011001111110011111111000110101100011000111110111100110011100011111111001010101101111100100010101001 c4bb3f3fc6b18fbcce3fcab7c8a9
UTF-8 鳥렫뤯同庾퍘雰肌 111010011011001110100101111010111010000010101011111010111010010010101111111001011001000010001100111001011011101010111110111011011000110110011000111010011001101110110000111010001000001010001100 e9b3a5eba0abeba4afe5908ce5babeed8d98e99bb0e8828c
UHC 鳥렫뤯同庾퍘雰肌 11110000111010001000111010111001100011111101110111010100110100101110101011101100101110111000111111011101110101001101000110111111 f0e88eb98fddd4d2eaecbb8fddd4d1bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)