To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 偲萍上ナ萍上ソ 10001110110000111110010011001010100011111110001111110001111100001100010111100100110010101000111111100011111100011110111110111111 8ec3e4ca8fe3f1f0c5e4ca8fe3f1efbf
EUC-JP 偲萍上?ナ萍上?ソ 10111100110001011110100011001100101111101110010100111111100011101100010111101000110011001011111011100101001111111000111010111111 bcc5e8ccbee53f8ec5e8ccbee53f8ebf
UTF-8 偲萍上ナ萍上ソ 111001011000000110110010111010001001000010001101111001001011100010001010111011101000010110101011111011111011111010000101111010001001000010001101111001001011100010001010111011101000010110101010111011111011110110111111 e581b2e8908de4b88aee85abefbe85e8908de4b88aee85aaefbdbf
UHC ?萍上??萍上?? 00111111111110001100001111011111101111100011111100111111111110001100001111011111101111100011111100111111 3ff8c3dfbe3f3ff8c3dfbe3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)