To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 頂頃??逡???烝???頂頃??逡???烝???^ 100100101011100010001101101000000011111100111111111001111001010100111111001111110011111111100000011111100011111100111111001111111001001010111000100011011010000000111111001111111110011110010101001111110011111100111111111000000111111000111111001111110011111101011110 92b88da03f3fe7953f3f3fe07e3f3f3f92b88da03f3fe7953f3f3fe07e3f3f3f5e
EUC-JP 頂頃??逡???烝???頂頃??逡???烝???^ 110001001011101010111010101000100011111100111111111011011111010100111111001111110011111111011111110111110011111100111111001111111100010010111010101110101010001000111111001111111110110111110101001111110011111100111111110111111101111100111111001111110011111101011110 c4babaa23f3fedf53f3f3fdfdf3f3f3fc4babaa23f3fedf53f3f3fdfdf3f3f3f5e
UTF-8 頂頃렰렒逡肋렰렑烝렓梨렢頂頃렰렒逡肋렰렑烝렓梨렗^ 11101001101000001000001011101001101000001000001111101011101000001011000011101011101000001001001011101001100000001010000111101111101001011001001111101011101000001011000011101011101000001001000111100111100000111001110111101011101000001001001111101111101001111010001011101011101000001010001011101001101000001000001011101001101000001000001111101011101000001011000011101011101000001001001011101001100000001010000111101111101001011001001111101011101000001011000011101011101000001001000111100111100000111001110111101011101000001001001111101111101001111010001011101011101000001001011101011110 e9a082e9a083eba0b0eba092e980a1efa593eba0b0eba091e7839deba093efa7a2eba0a2e9a082e9a083eba0b0eba092e980a1efa593eba0b0eba091e7839deba093efa7a2eba0975e
UHC 頂頃렰렒逡肋렰렑烝렓梨렢頂頃렰렒逡肋렰렑烝렓梨렗^ 11110000101000101100110011110001100011101011110110001110101001111111000111100100110100101111000110001110101111011000111010100110111100011111011010001110101010001110110010110001100011101011001111110000101000101100110011110001100011101011110110001110101001111111000111100100110100101111000110001110101111011000111010100110111100011111011010001110101010001110110010110001100011101010110001011110 f0a2ccf18ebd8ea7f1e4d2f18ebd8ea6f1f68ea8ecb18eb3f0a2ccf18ebd8ea7f1e4d2f18ebd8ea6f1f68ea8ecb18eac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)