To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 逵ゥ逖∝嶋逵ゥ逖∫ゥゥ逵ゥ逖∝嶋逵ゥ逖∫ゥゥB 11100111100111001010100111100111100110001000000111100101100100111000100011100111100111001010100111100111100110001000000111100111101010011010100111100111100111001010100111100111100110001000000111100101100100111000100011100111100111001010100111100111100110001000000111100111101010011010100101000010 e79ca9e79881e59388e79ca9e79881e7a9a9e79ca9e79881e59388e79ca9e79881e7a9a942
EUC-JP 逵ゥ逖∝嶋逵ゥ逖∫ゥゥ逵ゥ逖∝嶋逵ゥ逖∫ゥゥB 111011011111110010001110101010011110110111111000101000101110011111000101111010001110110111111100100011101010100111101101111110001010001011101001100011101010100110001110101010011110110111111100100011101010100111101101111110001010001011100111110001011110100011101101111111001000111010101001111011011111100010100010111010011000111010101001100011101010100101000010 edfc8ea9edf8a2e7c5e8edfc8ea9edf8a2e98ea98ea9edfc8ea9edf8a2e7c5e8edfc8ea9edf8a2e98ea98ea942
UTF-8 逵ゥ逖∝嶋逵ゥ逖∫ゥゥ逵ゥ逖∝嶋逵ゥ逖∫ゥゥB 11101001100000001011010111101111101111011010100111101001100000001001011011100010100010001001110111100101101101101000101111101001100000001011010111101111101111011010100111101001100000001001011011100010100010001010101111101111101111011010100111101111101111011010100111101001100000001011010111101111101111011010100111101001100000001001011011100010100010001001110111100101101101101000101111101001100000001011010111101111101111011010100111101001100000001001011011100010100010001010101111101111101111011010100111101111101111011010100101000010 e980b5efbda9e98096e2889de5b68be980b5efbda9e98096e288abefbda9efbda9e980b5efbda9e98096e2889de5b68be980b5efbda9e98096e288abefbda9efbda942
UHC 逵??∝嶋逵??∫??逵??∝嶋逵??∫??B 110100001011000000111111001111111010000111110000110100111111011111010000101100000011111100111111101000011111001000111111001111111101000010110000001111110011111110100001111100001101001111110111110100001011000000111111001111111010000111110010001111110011111101000010 d0b03f3fa1f0d3f7d0b03f3fa1f23f3fd0b03f3fa1f0d3f7d0b03f3fa1f23f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)