To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 汁???遠????沮????遠????應?B 100011110110000000111111001111110011111110001001100100110011111100111111001111110011111110011111100111000011111100111111001111110011111110001001100100110011111100111111001111110011111110011100111001000011111101000010 8f603f3f3f89933f3f3f3f9f9c3f3f3f3f89933f3f3f3f9ce43f42
EUC-JP 汁???遠????沮????遠????應?B 101111011100000100111111001111110011111110110001111100110011111100111111001111110011111111011101111111000011111100111111001111110011111110110001111100110011111100111111001111110011111111011000111001100011111101000010 bdc13f3f3fb1f33f3f3f3fddfc3f3f3f3fb1f33f3f3f3fd8e63f42
UTF-8 汁흗렓렜遠펭웃渽렜沮쾰웃渽렜遠펭웃渽렜應렱B 11100110101100011000000111101101100111011001011111101011101000001001001111101011101000001001110011101001100000011010000011101101100011101010110111101100100110111000001111100110101110001011110111101011101000001001110011100110101100101010111011101100101111101011000011101100100110111000001111100110101110001011110111101011101000001001110011101001100000011010000011101101100011101010110111101100100110111000001111100110101110001011110111101011101000001001110011100110100001111000100111101011101000001011000101000010 e6b181ed9d97eba093eba09ce981a0ed8eadec9b83e6b8bdeba09ce6b2aeecbeb0ec9b83e6b8bdeba09ce981a0ed8eadec9b83e6b8bdeba09ce68789eba0b142
UHC 汁흗렓렜遠펭웃渽렜沮쾰웃渽렜遠펭웃渽렜應렱B 11110001111100001100100011101001100011101010100010001110101011101110101011000000110001101110101110111111111101001110111010101010100011101010111011101110110000011100010011101011101111111111010011101110101010101000111010101110111010101100000011000110111010111011111111110100111011101010101010001110101011101110101111101011100011101011111001000010 f1f0c8e98ea88eaeeac0c6ebbff4eeaa8eaeeec1c4ebbff4eeaa8eaeeac0c6ebbff4eeaa8eaeebeb8ebe42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)