To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 犱失鉦豺メ鰊貂鉦戝痔犱失鍾ケメ鰄コ鍾テ痔B 111110110101101110001110101110001000111111011110111001101011011111010010111010011101011111100110101110001000111111011110111001101100000110001110101001001111101101011011100011101011100010001111110111111011100111010010111010011101100010111010100011111101111111000011100011101010010001000010 fb5b8eb88fdee6b7d2e9d7e6b88fdee6c18ea4fb5b8eb88fdfb9d2e9d8ba8fdfc38ea442
EUC-JP 犱失鉦豺メ鰊貂鉦戝痔犱失鍾ケメ鰄コ鍾テ痔B 10001111110010101110111110111100101110101011111011100000111011001011100110001110110100101111001011011001111011001011101010111110111000001110110011000011101111001010011010001111110010101110111110111100101110101011111011100001100011101011100110001110110100101111001011011010100011101011101010111110111000011000111011000011101111001010011001000010 8fcaefbcbabee0ecb98ed2f2d9ecbabee0ecc3bca68fcaefbcbabee18eb98ed2f2da8ebabee18ec3bca642
UTF-8 犱失鉦豺メ鰊貂鉦戝痔犱失鍾ケメ鰄コ鍾テ痔B 11100111100010101011000111100101101001001011000111101001100010011010011011101000101100011011101011101111101111101001001011101001101100001000101011101000101100101000001011101001100010011010011011100110100010001001110111100111100101111001010011100111100010101011000111100101101001001011000111101001100011011011111011101111101111011011100111101111101111101001001011101001101100001000010011101111101111011011101011101001100011011011111011101111101111101000001111100111100101111001010001000010 e78ab1e5a4b1e989a6e8b1baefbe92e9b08ae8b282e989a6e6889de79794e78ab1e5a4b1e98dbeefbdb9efbe92e9b084efbdbae98dbeefbe83e7979442
UHC ?失鉦豺??貂鉦?痔?失鍾????鍾?痔B 00111111111000111111011111101111111110101110001111001111001111110011111111110101101100001110111111111010001111111111011011000000001111111110001111110111111100011010001100111111001111110011111100111111111100011010001100111111111101101100000001000010 3fe3f7effae3cf3f3ff5b0effa3ff6c03fe3f7f1a33f3f3f3ff1a33ff6c042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)