To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴦???筌?????違??腋??伊??筍レ? 1110100111110001001111110011111100111111111000101010001100111111001111110011111100111111001111111000100011100001001111110011111111100011111111000011111100111111100010001100100100111111001111111110001010100001100000111000110000111111 e9f13f3f3fe2a33f3f3f3f3f88e13f3fe3fc3f3f88c93f3fe2a1838c3f
EUC-JP 鴦???筌??彛??違??腋??伊??筍レ? 11110010111100110011111100111111001111111110010010100101001111110011111110001111101111001111101000111111001111111011000011100011001111110011111111100110111111100011111100111111101100001100101100111111001111111110010010100011101001011110110000111111 f2f33f3f3fe4a53f3f8fbcfa3f3fb0e33f3fe6fe3f3fb0cb3f3fe4a3a5ec3f
UTF-8 鴦꾆뀀룱筌덈㉡彛싩솒違곸떱腋잆룂伊볣씣筍レ녇 111010011011010010100110111010101011111010000110111010111000000010000000111010111010001110110001111001111010110110001100111010111000110110001000111000111000100110100001111001011011110110011011111011001000101110101001111011001000011010010010111010011000000110010101111010101011001110111000111010111001011010110001111010001000010110001011111011001001111010000110111010111010001110000010111001001011110010001010111010111011001110100011111011001001010010100011111001111010110110001101111000111000001110101100111010111000010110000111 e9b4a6eabe86eb8080eba3b1e7ad8ceb8d88e389a1e5bd9bec8ba9ec8692e98195eab3b8eb96b1e8858bec9e86eba382e4bc8aebb3a3ec94a3e7ad8de383aceb8587
UHC 鴦꾆뀀룱筌덈㉡彛싩솒違곸떱腋잆룂伊볣씣筍レ녇 1110010011101100100001001100111010110010111010111000111110100110111011111010011110001000111010111010100010110010111011001010110110011010111001111001100110010010111010101101111010000001111011001011011010110111111001001111110110011111111000111000111110000011111011001010010110010011111010011001110110110111111000101110110010101011111011001000011010111110 e4ec84ceb2eb8fa6efa788eba8b2ecad9ae79992eade81ecb6b7e4fd9fe38f83eca593e99db7e2ecabec86be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)