To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??毅??宥?┛?ル?猷??矣??? 1110000110011111001111110011111110001011010000100011111100111111100101110100011100111111100001001010111000111111100000111000101100111111100101110101000100111111001111111110000111100001001111110011111100111111 e19f3f3f8b423f3f97473f84ae3f838b3f97513f3fe1e13f3f3f
EUC-JP 癲??毅??宥?┛?ル?猷??矣??獒 11100010101000010011111100111111101101011010001100111111001111111100110110101000001111111010100010110000001111111010010111101011001111111100110110110010001111110011111111100010111000110011111100111111100011111100101110111011 e2a13f3fb5a33f3fcda83fa8b03fa5eb3fcdb23f3fe2e33f3f8fcbbb
UTF-8 癲삳끃毅볞뭄宥븍┛曆ル봾猷녷첀矣곕샷獒 111001111001100110110010111011001000001010110011111010111000000110000011111001101010111110000101111010111011001110011110111010111010110110000100111001011010111010100101111010111011100010001101111000101001010010011011111011111010011010001011111000111000001110101011111010111011010010111110111001111000110010110111111010111000010110110111111011001011001010000000111001111001111110100011111010101011001110010101111011001000001110110111111001111000110110010010 e799b2ec82b3eb8183e6af85ebb39eebad84e5aea5ebb88de2949befa68be383abebb4bee78cb7eb85b7ecb280e79fa3eab395ec83b7e78d92
UHC 癲삳끃毅볞뭄宥븍┛曆ル봾猷녷첀矣곕샷獒 1110111110100110101110111110101110000101101110011110101111110110100100111110010010111001101100111110101011101001101110101110101110100110101100001110011010110111101010111110101110010100100001011110101110100011100001101110011010101010100011011110101111111000101100001110101110111100101001101110100010100011 efa6bbeb85b9ebf693e4b9b3eae9baeba6b0e6b7abeb9485eba386e6aa8debf8b0ebbca6e8a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)