To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蝓腫※骰м蝓腫※骰к^ 111001011001111110001110111011101000000110100110111010011000110110000100011111011110010110011111100011101110111010000001101001101110100110001101100001000111101101011110 e59f8eee81a6e98d847de59f8eee81a6e98d847b5e
EUC-JP 蝓腫※骰м蝓腫※骰к^ 111010101010000110111100111100001010001010101000111100011110110110100111110111101110101010100001101111001111000010100010101010001111000111101101101001111101110001011110 eaa1bcf0a2a8f1eda7deeaa1bcf0a2a8f1eda7dc5e
UTF-8 蝓腫※骰м蝓腫※骰к^ 1110100010011101100100111110100010000101101010111110001010000000101110111110100110101010101100001101000010111100111010001001110110010011111010001000010110101011111000101000000010111011111010011010101010110000110100001011101001011110 e89d93e885abe280bbe9aab0d0bce89d93e885abe280bbe9aab0d0ba5e
UHC ?腫※?м?腫※?к^ 0011111111110000111111101010000111011000001111111010110011011110001111111111000011111110101000011101100000111111101011001101110001011110 3ff0fea1d83facde3ff0fea1d83facdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)