To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????C?????Ni?????C?????NiB 001111110011111100111111001111110011111101000011001111110011111100111111001111110011111101001110011010010011111100111111001111110011111100111111010000110011111100111111001111110011111100111111010011100110100101000010 3f3f3f3f3f433f3f3f3f3f4e693f3f3f3f3f433f3f3f3f3f4e6942
SJIS-WIN 肛????C肛????Ni肛????C肛????NiB 11100011111010000011111100111111001111110011111101000011111000111110100000111111001111110011111100111111010011100110100111100011111010000011111100111111001111110011111101000011111000111110100000111111001111110011111100111111010011100110100101000010 e3e83f3f3f3f43e3e83f3f3f3f4e69e3e83f3f3f3f43e3e83f3f3f3f4e6942
EUC-JP 肛?勖??C肛?勖??Ni肛?勖??C肛?勖??NiB 111001101110101000111111100011111011001111101101001111110011111101000011111001101110101000111111100011111011001111101101001111110011111101001110011010011110011011101010001111111000111110110011111011010011111100111111010000111110011011101010001111111000111110110011111011010011111100111111010011100110100101000010 e6ea3f8fb3ed3f3f43e6ea3f8fb3ed3f3f4e69e6ea3f8fb3ed3f3f43e6ea3f8fb3ed3f3f4e6942
UTF-8 肛렚勖쾌백C肛렚勖쾌백Ni肛렚勖쾌백C肛렚勖쾌백NiB 11101000100000101001101111101011101000001001101011100101100010111001011011101100101111101000110011101011101100001011000101000011111010001000001010011011111010111010000010011010111001011000101110010110111011001011111010001100111010111011000010110001010011100110100111101000100000101001101111101011101000001001101011100101100010111001011011101100101111101000110011101011101100001011000101000011111010001000001010011011111010111010000010011010111001011000101110010110111011001011111010001100111010111011000010110001010011100110100101000010 e8829beba09ae58b96ecbe8cebb0b143e8829beba09ae58b96ecbe8cebb0b14e69e8829beba09ae58b96ecbe8cebb0b143e8829beba09ae58b96ecbe8cebb0b14e6942
UHC 肛렚勖쾌백C肛렚勖쾌백Ni肛렚勖쾌백C肛렚勖쾌백NiB 1111100111111101100011101010110111101001111011011100010011101000101110011110100101000011111110011111110110001110101011011110100111101101110001001110100010111001111010010100111001101001111110011111110110001110101011011110100111101101110001001110100010111001111010010100001111111001111111011000111010101101111010011110110111000100111010001011100111101001010011100110100101000010 f9fd8eade9edc4e8b9e943f9fd8eade9edc4e8b9e94e69f9fd8eade9edc4e8b9e943f9fd8eade9edc4e8b9e94e6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)