To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 將?障?吟?弔?垣?鬱頭彬B 10011011100100100011111110001111111000010011111110001011111000010011111110010010101000100011111110001010010111110011111110011111010101001001001110101010100101010110101001000010 9b923f8fe13f8be13f92a23f8a5f3f9f5493aa956a42
EUC-JP 將?障?吟?弔?垣?鬱頭彬B 11010101111100100011111110111110111000110011111110110110111000110011111111000100101001000011111110110011110000000011111111011101101101011100011010101100110010011100101101000010 d5f23fbee33fb6e33fc4a43fb3c03fddb5c6acc9cb42
UTF-8 將렚障렚吟렞弔렲垣렖鬱頭彬B 11100101101100001000011111101011101000001001101011101001100110101001110011101011101000001001101011100101100100001001111111101011101000001001111011100101101111001001010011101011101000001011001011100101100111101010001111101011101000001001011011101001101011001011000111101001101000001010110111100101101111011010110001000010 e5b087eba09ae99a9ceba09ae5909feba09ee5bc94eba0b2e59ea3eba096e9acb1e9a0ade5bdac42
UHC 將렚障렚吟렞弔렲垣렖鬱頭彬B 111011011110001010001110101011011110111010100001100011101010110111101011111000011000111010101111111100001100000010001110101111111110101010101111100011101010101111101010101001101101010011101001110111101010111101000010 ede28eadeea18eadebe18eaff0c08ebfeaaf8eabeaa6d4e9deaf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)