To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN チ「称ケタ袱ネタ蠡蛩チ「称ケタ袱ネタ蠡蛔^ 1100000110100010100011111100110010111001110000001110010111011110110010001100000011100101110000001110010101111101110000011010001010001111110011001011100111000000111001011101111011001000110000001110010111000000111001010111101101011110 c1a28fccb9c0e5dec8c0e5c0e57dc1a28fccb9c0e5dec8c0e5c0e57b5e
EUC-JP チ「称ケタ袱ネタ蠡蛩チ「称ケタ袱ネタ蠡蛔^ 1000111011000001100011101010001010111110110011101000111010111001100011101100000011101010111000001000111011001000100011101100000011101010110000101110100111011110100011101100000110001110101000101011111011001110100011101011100110001110110000001110101011100000100011101100100010001110110000001110101011000010111010011101110001011110 8ec18ea2bece8eb98ec0eae08ec88ec0eac2e9de8ec18ea2bece8eb98ec0eae08ec88ec0eac2e9dc5e
UTF-8 チ「称ケタ袱ネタ蠡蛩チ「称ケタ袱ネタ蠡蛔^ 11101111101111101000000111101111101111011010001011100111101001111011000011101111101111011011100111101111101111101000000011101000101000101011000111101111101111101000100011101111101111101000000011101000101000001010000111101000100110111010100111101111101111101000000111101111101111011010001011100111101001111011000011101111101111011011100111101111101111101000000011101000101000101011000111101111101111101000100011101111101111101000000011101000101000001010000111101000100110111001010001011110 efbe81efbda2e7a7b0efbdb9efbe80e8a2b1efbe88efbe80e8a0a1e89ba9efbe81efbda2e7a7b0efbdb9efbe80e8a2b1efbe88efbe80e8a0a1e89b945e
UHC ???????????????????蛔^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111111111001110111001011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3ffcee5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)