To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蘖??役?蘖??役?^ 100111110101000000111111001111111001011011110000001111111001111101010000001111110011111110010110111100000011111101011110 9f503f3f96f03f9f503f3f96f03f5e
EUC-JP 蘖??役?蘖??役?^ 110111011011000100111111001111111100110011110010001111111101110110110001001111110011111111001100111100100011111101011110 ddb13f3fccf23fddb13f3fccf23f5e
UTF-8 蘖붾튉役쥵蘖붾튉役쥵^ 11101000100110001001011011101011101101101011111011101101100010101000100111100101101111011011100111101100101001011011010111101000100110001001011011101011101101101011111011101101100010101000100111100101101111011011100111101100101001011011010101011110 e89896ebb6beed8a89e5bdb9eca5b5e89896ebb6beed8a89e5bdb9eca5b55e
UHC 蘖붾튉役쥵蘖붾튉役쥵^ 111001011110111010010100111010111011100110011101111001101011010110100011010001001110010111101110100101001110101110111001100111011110011010110101101000110100010001011110 e5ee94ebb99de6b5a344e5ee94ebb99de6b5a3445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)