To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????`^SB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101100000010111100101001101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f605e5342
SJIS-WIN 耶??癌?????癌??耶??癌??`^SB 100101101110101100111111001111111000101011100000001111110011111100111111001111110011111110001010111000000011111100111111100101101110101100111111001111111000101011100000001111110011111101100000010111100101001101000010 96eb3f3f8ae03f3f3f3f3f8ae03f3f96eb3f3f8ae03f3f605e5342
EUC-JP 耶??癌?????癌??耶??癌??`^SB 110011001110110100111111001111111011010011100010001111110011111100111111001111110011111110110100111000100011111100111111110011001110110100111111001111111011010011100010001111110011111101100000010111100101001101000010 cced3f3fb4e23f3f3f3f3fb4e23f3fcced3f3fb4e23f3f605e5342
UTF-8 耶⑸젦癌뺣젻若노젶癌뺣젽耶⑸젫癌뺣젧`^SB 11101000100000001011011011100010100100011011100011101100101000001010011011100111100110011000110011101011101110101010001111101100101000001011101111101111101001011011010011101011100001011011100011101100101000001011011011100111100110011000110011101011101110101010001111101100101000001011110111101000100000001011011011100010100100011011100011101100101000001010101111100111100110011000110011101011101110101010001111101100101000001010011101100000010111100101001101000010 e880b6e291b8eca0a6e7998cebbaa3eca0bbefa5b4eb85b8eca0b6e7998cebbaa3eca0bde880b6e291b8eca0abe7998cebbaa3eca0a7605e5342
UHC 耶⑸젦癌뺣젻若노젶癌뺣젽耶⑸젫癌뺣젧`^SB 11100101101011011010100111101011101000001001111011100100110111111001010111101011101000001010111011100101101011101011001111101011101000001010101011100100110111111001010111101011101000001010111111100101101011011010100111101011101000001010001111100100110111111001010111101011101000001001111101100000010111100101001101000010 e5ada9eba09ee4df95eba0aee5aeb3eba0aae4df95eba0afe5ada9eba0a3e4df95eba09f605e5342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)