To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?B?Bf?B?B^}Y?B?Bf?B?B^}bE 00111111010000100011111101000010011001100011111101000010001111110100001001011110011111010101100100111111010000100011111101000010011001100011111101000010001111110100001001011110011111010110001001000101 3f423f42663f423f425e7d593f423f42663f423f425e7d6245
SJIS-WIN 短B短Bf短B短B^}Y短B短Bf短B短B^}bE 100100100101101001000010100100100101101001000010011001101001001001011010010000101001001001011010010000100101111001111101010110011001001001011010010000101001001001011010010000100110011010010010010110100100001010010010010110100100001001011110011111010110001001000101 925a42925a4266925a42925a425e7d59925a42925a4266925a42925a425e7d6245
EUC-JP 短B短Bf短B短B^}Y短B短Bf短B短B^}bE 110000111011101101000010110000111011101101000010011001101100001110111011010000101100001110111011010000100101111001111101010110011100001110111011010000101100001110111011010000100110011011000011101110110100001011000011101110110100001001011110011111010110001001000101 c3bb42c3bb4266c3bb42c3bb425e7d59c3bb42c3bb4266c3bb42c3bb425e7d6245
UTF-8 短B短Bf短B短B^}Y短B短Bf短B短B^}bE 1110011110011111101011010100001011100111100111111010110101000010011001101110011110011111101011010100001011100111100111111010110101000010010111100111110101011001111001111001111110101101010000101110011110011111101011010100001001100110111001111001111110101101010000101110011110011111101011010100001001011110011111010110001001000101 e79fad42e79fad4266e79fad42e79fad425e7d59e79fad42e79fad4266e79fad42e79fad425e7d6245
UHC 短B短Bf短B短B^}Y短B短Bf短B短B^}bE 110100111010110101000010110100111010110101000010011001101101001110101101010000101101001110101101010000100101111001111101010110011101001110101101010000101101001110101101010000100110011011010011101011010100001011010011101011010100001001011110011111010110001001000101 d3ad42d3ad4266d3ad42d3ad425e7d59d3ad42d3ad4266d3ad42d3ad425e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)