To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?????惟?????油??碎??耶??肉? 1110000110011111001111110011111100111111001111110011111110001000110100100011111100111111001111110011111100111111100101101111101100111111001111111110000111101010001111110011111110010110111010110011111100111111100100111111011100111111 e19f3f3f3f3f3f88d23f3f3f3f3f96fb3f3fe1ea3f3f96eb3f3f93f73f
EUC-JP 癲?????惟?????油??碎??耶??肉? 1110001010100001001111110011111100111111001111110011111110110000110101000011111100111111001111110011111100111111110011001111110100111111001111111110001011101100001111110011111111001100111011010011111100111111110001101111100100111111 e2a13f3f3f3f3fb0d43f3f3f3f3fccfd3f3fe2ec3f3fcced3f3fc6f93f
UTF-8 癲앷퀣杻볠꼷惟㏉뫛濾낅쉥油꿴굫碎밸탟耶껊돆肉묪 111001111001100110110010111011001001010110110111111011011000000010100011111011111010011110001000111010111011001110100000111010101011110010110111111001101000001110011111111000111000111110001001111010111010101110011011111011111010011010000100111010111000001010000101111011001000100110100101111001101011001010111001111010101011111110110100111010101011010110101011111001111010001010001110111010111011000010111000111011011000001110011111111010001000000010110110111010101011101110001010111010111000111110000110111010001000001010001001111010111010110010101010 e799b2ec95b7ed80a3efa788ebb3a0eabcb7e6839fe38f89ebab9befa684eb8285ec89a5e6b2b9eabfb4eab5abe7a28eebb0b8ed839fe880b6eabb8aeb8f86e88289ebacaa
UHC 癲앷퀣杻볠꼷惟㏉뫛濾낅쉥油꿴굫碎밸탟耶껊돆肉묪 11101111101001101001110111101010101100111001011111101010111101001001001111100110100001001000111111101010111011101010011111101101100100011011101111100110101001001000010111101011101111011010101111101010111110101011001011101001100000101001000111100001111011111011100111101011101101011000001111100101101011011000001111101011100010011001011111101011101111111001001001000010 efa69deab397eaf493e6848feaeea7ed91bbe6a485ebbdabeafab2e98291e1efb9ebb583e5ad83eb8997ebbf9242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)