To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B?????????B^ 001111110011111100111111001111110011111100111111001111110011111100111111010000100011111100111111001111110011111100111111001111110011111100111111001111110100001001011110 3f3f3f3f3f3f3f3f3f423f3f3f3f3f3f3f3f3f425e
SJIS-WIN 癲??猶?????B癲??猶?????B^ 11100001100111110011111100111111100101110101000000111111001111110011111100111111001111110100001011100001100111110011111100111111100101110101000000111111001111110011111100111111001111110100001001011110 e19f3f3f97503f3f3f3f3f42e19f3f3f97503f3f3f3f3f425e
EUC-JP 癲??猶?????B癲??猶?????B^ 11100010101000010011111100111111110011011011000100111111001111110011111100111111001111110100001011100010101000010011111100111111110011011011000100111111001111110011111100111111001111110100001001011110 e2a13f3fcdb13f3f3f3f3f42e2a13f3fcdb13f3f3f3f3f425e
UTF-8 癲뺣슦猶쒐땸流곷쥥B癲뺣슦猶쒐땸流곷쥥B^ 111001111001100110110010111010111011101010100011111011001000101010100110111001111000110010110110111011001001001010010000111010111001010110111000111011111010011110001010111010101011001110110111111011001010010110100101010000101110011110011001101100101110101110111010101000111110110010001010101001101110011110001100101101101110110010010010100100001110101110010101101110001110111110100111100010101110101010110011101101111110110010100101101001010100001001011110 e799b2ebbaa3ec8aa6e78cb6ec9290eb95b8efa78aeab3b7eca5a542e799b2ebbaa3ec8aa6e78cb6ec9290eb95b8efa78aeab3b7eca5a5425e
UHC 癲뺣슦猶쒐땸流곷쥥B癲뺣슦猶쒐땸流곷쥥B^ 111011111010011010010101111010111001101010110000111010111010001010011100111001111000101110001110111010101111110010000001111010111010001010010111010000101110111110100110100101011110101110011010101100001110101110100010100111001110011110001011100011101110101011111100100000011110101110100010100101110100001001011110 efa695eb9ab0eba29ce78b8eeafc81eba29742efa695eb9ab0eba29ce78b8eeafc81eba297425e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)