To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????m}?????????m{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110110101111101001111110011111100111111001111110011111100111111001111110011111100111111011011010111101101011110 3f3f3f3f3f3f3f3f3f6d7d3f3f3f3f3f3f3f3f3f6d7b5e
SJIS-WIN 亦?????鷹??m}亦?????鷹??m{^ 100101101001001000111111001111110011111100111111001111111001000111101001001111110011111101101101011111011001011010010010001111110011111100111111001111110011111110010001111010010011111100111111011011010111101101011110 96923f3f3f3f3f91e93f3f6d7d96923f3f3f3f3f91e93f3f6d7b5e
EUC-JP 亦??瑗??鷹??m}亦??瑗??鷹??m{^ 11001011111100100011111100111111100011111100110011000000001111110011111111000010111010110011111100111111011011010111110111001011111100100011111100111111100011111100110011000000001111110011111111000010111010110011111100111111011011010111101101011110 cbf23f3f8fccc03f3fc2eb3f3f6d7dcbf23f3f8fccc03f3fc2eb3f3f6d7b5e
UTF-8 亦껓퐟瑗뉐틫鷹숈돁m}亦껓퐟瑗뉐틫鷹숈돁m{^ 1110010010111010101001101110101010111011100100111110110110010000100111111110011110010001100101111110101110001001100100001110110110001011101010111110100110110111101110011110110010001000100010001110101110001111100000010110110101111101111001001011101010100110111010101011101110010011111011011001000010011111111001111001000110010111111010111000100110010000111011011000101110101011111010011011011110111001111011001000100010001000111010111000111110000001011011010111101101011110 e4baa6eabb93ed909fe79197eb8990ed8babe9b7b9ec8888eb8f816d7de4baa6eabb93ed909fe79197eb8990ed8babe9b7b9ec8888eb8f816d7b5e
UHC 亦껓퐟瑗뉐틫鷹숈돁m}亦껓퐟瑗뉐틫鷹숈돁m{^ 1110011010110010100000111110111110111101100010001110101010111100100001111110010110111010100101011110101111101101100110011110110010001001100101000110110101111101111001101011001010000011111011111011110110001000111010101011110010000111111001011011101010010101111010111110110110011001111011001000100110010100011011010111101101011110 e6b283efbd88eabc87e5ba95ebed99ec89946d7de6b283efbd88eabc87e5ba95ebed99ec89946d7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)