To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 趁壌åò´ 111010001011011010000001111001011010001110001100111001011111001010110100 e8b681e5a38ce5f2b4
SJIS-WIN ?¶??£???´ 001111111000000111110111001111110011111110000001100100100011111100111111001111111000000101001100 3f81f73f3f81923f3f3f814c
EUC-JP è¶?å£?åò´ 1000111110101011101100101010001011111001001111111000111110101011101010011010000111110010001111111000111110101011101010011000111110101011110100101010000110101101 8fabb2a2f93f8faba9a1f23f8faba98fabd2a1ad
UTF-8 趁壌åò´ 110000111010100011000010101101101100001010000001110000111010010111000010101000111100001010001100110000111010010111000011101100101100001010110100 c3a8c2b6c281c3a5c2a3c28cc3a5c3b2c2b4
UHC ?¶??????´ 0011111110100010110100100011111100111111001111110011111100111111001111111010001010100101 3fa2d23f3f3f3f3f3fa2a5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)