To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 陋ケ蜆夊峪蜀芽峪蜀厭陋ケ蜆夊峪蜀芽峪蜀閲^ 111010001001101110111001111001011000010010011010111010001001101110111001111001011000011010001001111010001001101110111001111001011000011010001001011111011110100010011011101110011110010110000100100110101110100010011011101110011110010110000110100010011110100010011011101110011110010110000110100010010111101101011110 e89bb9e5849ae89bb9e58689e89bb9e586897de89bb9e5849ae89bb9e58689e89bb9e586897b5e
EUC-JP 陋ケ蜆夊峪蜀芽峪蜀厭陋ケ蜆夊峪蜀芽峪蜀閲^ 1110111111111011100011101011100111101001111001001101010011101010110101101011101111101001111001101011001011101010110101101011101111101001111001101011000111011110111011111111101110001110101110011110100111100100110101001110101011010110101110111110100111100110101100101110101011010110101110111110100111100110101100011101110001011110 effb8eb9e9e4d4ead6bbe9e6b2ead6bbe9e6b1deeffb8eb9e9e4d4ead6bbe9e6b2ead6bbe9e6b1dc5e
UTF-8 陋ケ蜆夊峪蜀芽峪蜀厭陋ケ蜆夊峪蜀芽峪蜀閲^ 11101001100110011000101111101111101111011011100111101000100111001000011011100101101001001000101011100101101100111010101011101000100111001000000011101000100010101011110111100101101100111010101011101000100111001000000011100101100011101010110111101001100110011000101111101111101111011011100111101000100111001000011011100101101001001000101011100101101100111010101011101000100111001000000011101000100010101011110111100101101100111010101011101000100111001000000011101001100101101011001001011110 e9998befbdb9e89c86e5a48ae5b3aae89c80e88abde5b3aae89c80e58eade9998befbdb9e89c86e5a48ae5b3aae89c80e88abde5b3aae89c80e996b25e
UHC 陋????蜀芽?蜀厭陋????蜀芽?蜀?^ 110101111011000000111111001111110011111100111111111101011011100111100100101101000011111111110101101110011110011011110100110101111011000000111111001111110011111100111111111101011011100111100100101101000011111111110101101110010011111101011110 d7b03f3f3f3ff5b9e4b43ff5b9e6f4d7b03f3f3f3ff5b9e4b43ff5b93f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)