To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B?????????B^ 001111110011111100111111001111110011111100111111001111110011111100111111010000100011111100111111001111110011111100111111001111110011111100111111001111110100001001011110 3f3f3f3f3f3f3f3f3f423f3f3f3f3f3f3f3f3f425e
SJIS-WIN 癲?????鷹??B癲?????鷹??B^ 11100001100111110011111100111111001111110011111100111111100100011110100100111111001111110100001011100001100111110011111100111111001111110011111100111111100100011110100100111111001111110100001001011110 e19f3f3f3f3f3f91e93f3f42e19f3f3f3f3f3f91e93f3f425e
EUC-JP 癲?????鷹??B癲?????鷹??B^ 11100010101000010011111100111111001111110011111100111111110000101110101100111111001111110100001011100010101000010011111100111111001111110011111100111111110000101110101100111111001111110100001001011110 e2a13f3f3f3f3fc2eb3f3f42e2a13f3f3f3f3fc2eb3f3f425e
UTF-8 癲얜짘栒귛틫鷹숈뒟B癲얜짘栒귛틫鷹숈뒟B^ 111001111001100110110010111011001001011010011100111011001010011110011000111001101010000010010010111010101011011110011011111011011000101110101011111010011011011110111001111011001000100010001000111010111001001010011111010000101110011110011001101100101110110010010110100111001110110010100111100110001110011010100000100100101110101010110111100110111110110110001011101010111110100110110111101110011110110010001000100010001110101110010010100111110100001001011110 e799b2ec969ceca798e6a092eab79bed8babe9b7b9ec8888eb929f42e799b2ec969ceca798e6a092eab79bed8babe9b7b9ec8888eb929f425e
UHC 癲얜짘栒귛틫鷹숈뒟B癲얜짘栒귛틫鷹숈뒟B^ 111011111010011010111110111010111010001110011111111000101110001110000010111001011011101010010101111010111110110110011001111011001000101010011011010000101110111110100110101111101110101110100011100111111110001011100011100000101110010110111010100101011110101111101101100110011110110010001010100110110100001001011110 efa6beeba39fe2e382e5ba95ebed99ec8a9b42efa6beeba39fe2e382e5ba95ebed99ec8a9b425e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)