To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???揖ヤ??щ?嚴щ?徇??乙??魚 001111110011111100111111100101110100101110000011100001000011111100111111100001001000101100111111100110101000111010000100100010110011111110011100011011010011111100111111100010011011001100111111001111111000101110011011 3f3f3f974b83843f3f848b3f9a8e848b3f9c6d3f3f89b33f3f8b9b
EUC-JP ???揖ヤ?蓀щ?嚴щ?徇??乙??魚 0011111100111111001111111100110110101100101001011110010000111111100011111101100011111000101001111110101100111111110100111110111010100111111010110011111111010111110011100011111100111111101100101011010100111111001111111011010111111011 3f3f3fcdaca5e43f8fd8f8a7eb3fd3eea7eb3fd7ce3f3fb2b53f3fb5fb
UTF-8 嶪용뜆揖ヤ펺蓀щ궔嚴щ벡徇끾릸乙논떦魚 11100101101101101010101011101100100110101010100111101011100111001000011011100110100011111001011011100011100000111010010011101101100011101011101011101000100100111000000011010001100010011110101010110110100101001110010110011010101101001101000110001001111010111011001010100001111001011011111010000111111010111000000110111110111010111010011010111000111001001011100110011001111010111000010110111100111010111001011010100110111010011010110110011010 e5b6aaec9aa9eb9c86e68f96e383a4ed8ebae89380d189eab694e59ab4d189ebb2a1e5be87eb81beeba6b8e4b999eb85bceb96a6e9ad9a
UHC 嶪용뜆揖ヤ펺蓀щ궔嚴щ벡徇끾릸乙논떦魚 1110010111110101101111111110101110001101100010011110101111100111101010111110010010111100100010101110000111100000101011001110101110000010101010011110010111110001101011001110101110111010101001001110001011011111100001011110011010010000100101101110101111100000101100111110110110001011101110011110010111100000 e5f5bfeb8d89ebe7abe4bc8ae1e0aceb82a9e5f1acebbaa4e2df85e69096ebe0b3ed8bb9e5e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)