To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 狹豎狹瞿狹豎狹豬 11100000110000111110011010110001111000001100001111100001110110001110000011000011111001101011000111100000110000111110011010110101 e0c3e6b1e0c3e1d8e0c3e6b1e0c3e6b5
EUC-JP 狹豎狹瞿狹豎狹豬 11100000110001011110110010110011111000001100010111100010110110101110000011000101111011001011001111100000110001011110110010110111 e0c5ecb3e0c5e2dae0c5ecb3e0c5ecb7
UTF-8 狹豎狹瞿狹豎狹豬 111001111000101110111001111010001011000110001110111001111000101110111001111001111001111010111111111001111000101110111001111010001011000110001110111001111000101110111001111010001011000110101100 e78bb9e8b18ee78bb9e79ebfe78bb9e8b18ee78bb9e8b1ac
UHC 狹?狹瞿狹?狹? 11111010111101010011111111111010111101011100111110111010111110101111010100111111111110101111010100111111 faf53ffaf5cfbafaf53ffaf53f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)