To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 螂ェ蜊伜ー願セソ邯壼ア槫キス驕碑ウ雁キス貂ャ 111001011010010110101010111001011000110110011000111001011011000010001010111010001011111010111111111001111011011010011010111001011011000110011110111001011011011110111101111010011000000110010100111010001011001110001010111001011011011110111101111001101011100010101100 e5a5aae58d98e5b08ae8bebfe7b69ae5b19ee5b7bde98194e8b38ae5b7bde6b8ac
EUC-JP 螂ェ蜊伜ー願セソ邯壼ア槫キス驕碑ウ雁キス貂ャ 1110101010100111100011101010101011101001111011011101000011100111100011101011000010110100111010101000111010111110100011101011111111101110101110001101010011100111100011101011000111011100111001111000111010110111100011101011110111110001111000011100100011101010100011101011001110110100111001111000111010110111100011101011110111101100101110101000111010101100 eaa78eaae9edd0e78eb0b4ea8ebe8ebfeeb8d4e78eb1dce78eb78ebdf1e1c8ea8eb3b4e78eb78ebdecba8eac
UTF-8 螂ェ蜊伜ー願セソ邯壼ア槫キス驕碑ウ雁キス貂ャ 111010001001111010000010111011111011110110101010111010001001110010001010111001001011110010011100111011111011110110110000111010011010000110011000111011111011110110111110111011111011110110111111111010011000001010101111111001011010001110111100111011111011110110110001111001101010011110101011111011111011110110110111111011111011110110111101111010011010100110010101111001111010001010010001111011111011110110110011111010011001101110000001111011111011110110110111111011111011110110111101111010001011001010000010111011111011110110101100 e89e82efbdaae89c8ae4bc9cefbdb0e9a198efbdbeefbdbfe982afe5a3bcefbdb1e6a7abefbdb7efbdbde9a995e7a291efbdb3e99b81efbdb7efbdbde8b282efbdac
UHC 螂????願??邯?????驕碑?雁??貂? 1101010111001100001111110011111100111111001111111110101011000011001111110011111111001010111110110011111100111111001111110011111100111111110011101111011011011101111110000011111111100100110100100011111100111111111101011011000000111111 d5cc3f3f3f3feac33f3fcafb3f3f3f3f3fcef6ddf83fe4d23f3ff5b03f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)