To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 賊?倚ⅴ賊?倚ⅴ^ 100100011010111100111111100110001101111111111010010001001001000110101111001111111001100011011111111110100100010001011110 91af3f98dffa4491af3f98dffa445e
EUC-JP 賊?倚?賊?倚?^ 11000010101100010011111111010000111000010011111111000010101100010011111111010000111000010011111101011110 c2b13fd0e13fc2b13fd0e13f5e
UTF-8 賊렠倚ⅴ賊렠倚ⅴ^ 11101000101100111000101011101011101000001010000011100101100000001001101011100010100001011011010011101000101100111000101011101011101000001010000011100101100000001001101011100010100001011011010001011110 e8b38aeba0a0e5809ae285b4e8b38aeba0a0e5809ae285b45e
UHC 賊렠倚ⅴ賊렠倚ⅴ^ 1110111011100100100011101011000111101011111011111010010110100101111011101110010010001110101100011110101111101111101001011010010101011110 eee48eb1ebefa5a5eee48eb1ebefa5a55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)