To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???^h???^fN}???^h???^fN{^ 00111111001111110011111101011110011010000011111100111111001111110101111001100110010011100111110100111111001111110011111101011110011010000011111100111111001111110101111001100110010011100111101101011110 3f3f3f5e683f3f3f5e664e7d3f3f3f5e683f3f3f5e664e7b5e
SJIS-WIN 舌?齷^h舌?齷^fN}舌?齷^h舌?齷^fN{^ 100100001110001100111111111010101001100101011110011010001001000011100011001111111110101010011001010111100110011001001110011111011001000011100011001111111110101010011001010111100110100010010000111000110011111111101010100110010101111001100110010011100111101101011110 90e33fea995e6890e33fea995e664e7d90e33fea995e6890e33fea995e664e7b5e
EUC-JP 舌?齷^h舌?齷^fN}舌?齷^h舌?齷^fN{^ 110000001110010100111111111100111111100101011110011010001100000011100101001111111111001111111001010111100110011001001110011111011100000011100101001111111111001111111001010111100110100011000000111001010011111111110011111110010101111001100110010011100111101101011110 c0e53ff3f95e68c0e53ff3f95e664e7dc0e53ff3f95e68c0e53ff3f95e664e7b5e
UTF-8 舌說齷^h舌說齷^fN}舌說齷^h舌說齷^fN{^ 11101000100010001000110011101000101010101010101011101001101111011011011101011110011010001110100010001000100011001110100010101010101010101110100110111101101101110101111001100110010011100111110111101000100010001000110011101000101010101010101011101001101111011011011101011110011010001110100010001000100011001110100010101010101010101110100110111101101101110101111001100110010011100111101101011110 e8888ce8aaaae9bdb75e68e8888ce8aaaae9bdb75e664e7de8888ce8aaaae9bdb75e68e8888ce8aaaae9bdb75e664e7b5e
UHC 舌說齷^h舌說齷^fN}舌說齷^h舌說齷^fN{^ 11100000110111111110000011100011111001001100101101011110011010001110000011011111111000001110001111100100110010110101111001100110010011100111110111100000110111111110000011100011111001001100101101011110011010001110000011011111111000001110001111100100110010110101111001100110010011100111101101011110 e0dfe0e3e4cb5e68e0dfe0e3e4cb5e664e7de0dfe0e3e4cb5e68e0dfe0e3e4cb5e664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)