To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??伊??應??如??揖??臾?????乙? 111010011111001000111111001111111000100011001001001111110011111110011100111001000011111100111111100101000100000000111111001111111001011101001011001111110011111111100100011010110011111100111111001111110011111100111111100010011011001100111111 e9f23f3f88c93f3f9ce43f3f94403f3f974b3f3fe46b3f3f3f3f3f89b33f
EUC-JP 鶯??伊??應??如??揖??臾?????乙? 111100101111010000111111001111111011000011001011001111110011111111011000111001100011111100111111110001111010000100111111001111111100110110101100001111110011111111100111110011000011111100111111001111110011111100111111101100101011010100111111 f2f43f3fb0cb3f3fd8e63f3fc7a13f3fcdac3f3fe7cc3f3f3f3f3fb2b53f
UTF-8 鶯ㅺ퉮伊쒏룚應밸룆如싳뭿揖썽넫臾믩뤊麗몃쓹乙쏣 111010011011011010101111111000111000010110111010111011011000100110101110111001001011110010001010111011001001001010001111111010111010001110011010111001101000011110001001111010111011000010111000111010111010001110000110111001011010011010000010111011001000101110110011111010111010110110111111111001101000111110010110111011001000110110111101111010111000010010101011111010001000011110111110111010111010111110101001111010111010010010001010111011111010011010001000111010111010101010000011111011001001001110111001111001001011100110011001111011001000111110100011 e9b6afe385baed89aee4bc8aec928feba39ae68789ebb0b8eba386e5a682ec8bb3ebadbfe68f96ec8dbdeb84abe887beebafa9eba48aefa688ebaa83ec93b9e4b999ec8fa3
UHC 鶯ㅺ퉮伊쒏룚應밸룆如싳뭿揖썽넫臾믩뤊麗몃쓹乙쏣 11100101101000111010010011101010101110011000011011101100101001011001110011100110100011111001011011101011111010111011100111101011100011111000010111100101111111011001101011101100100100101000111011101011111001111011110111101001100001101010101111101011101011001001001011101011100011111011101011100110101100001011100011101011100111011001010111101011111000001001110001000101 e5a3a4eab986eca59ce68f96ebebb9eb8f85e5fd9aec928eebe7bde986abebac92eb8fbae6b0b8eb9d95ebe09c45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)