To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????k 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101011 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6b
SJIS-WIN 逶門嶋逍ス逵ゥ逖∫料逶門嶋逍ス逵ゥ逖∫怱k 11100111100110111001011011100101100100111000100011100111100101101011110111100111100111001010100111100111100110001000000111100111100101111011111111100111100110111001011011100101100100111000100011100111100101101011110111100111100111001010100111100111100110001000000111100111100111001000010001101011 e79b96e59388e796bde79ca9e79881e797bfe79b96e59388e796bde79ca9e79881e79c846b
EUC-JP 逶門嶋逍ス逵ゥ逖∫料逶門嶋逍ス逵ゥ逖∫怱k 1110110111111011110011001110011111000101111010001110110111110110100011101011110111101101111111001000111010101001111011011111100010100010111010011100111011000001111011011111101111001100111001111100010111101000111011011111011010001110101111011110110111111100100011101010100111101101111110001010001011101001110101111110010001101011 edfbcce7c5e8edf68ebdedfc8ea9edf8a2e9cec1edfbcce7c5e8edf68ebdedfc8ea9edf8a2e9d7e46b
UTF-8 逶門嶋逍ス逵ゥ逖∫料逶門嶋逍ス逵ゥ逖∫怱k 11101001100000001011011011101001100101101000000011100101101101101000101111101001100000001000110111101111101111011011110111101001100000001011010111101111101111011010100111101001100000001001011011100010100010001010101111100110100101101001100111101001100000001011011011101001100101101000000011100101101101101000101111101001100000001000110111101111101111011011110111101001100000001011010111101111101111011010100111101001100000001001011011100010100010001010101111100110100000001011000101101011 e980b6e99680e5b68be9808defbdbde980b5efbda9e98096e288abe69699e980b6e99680e5b68be9808defbdbde980b5efbda9e98096e288abe680b16b
UHC ?門嶋逍?逵??∫料?門嶋逍?逵??∫?k 0011111111011010101001101101001111110111111000011100111000111111110100001011000000111111001111111010000111110010110101101111100100111111110110101010011011010011111101111110000111001110001111111101000010110000001111110011111110100001111100100011111101101011 3fdaa6d3f7e1ce3fd0b03f3fa1f2d6f93fdaa6d3f7e1ce3fd0b03f3fa1f23f6b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)