To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 箏簑鮏簔矣箏モタ 11100010101101011110001011000000111111000100001111100010110000011110000111100001111101011011110111100010101101011101001111000000 e2b5e2c0fc43e2c1e1e1f5bde2b5d3c0
EUC-JP 箏簑鮏簔矣?箏モタ 111001001011011111100100110000101000111111101010110110111110010011000011111000101110001100111111111001001011011110001110110100111000111011000000 e4b7e4c28feadbe4c3e2e33fe4b78ed38ec0
UTF-8 箏簑鮏簔矣箏モタ 111001111010111010001111111001111011000010010001111010011010111010001111111001111011000010010100111001111001111110100011111011101001000010101000111001111010111010001111111011111011111010010011111011111011111010000000 e7ae8fe7b091e9ae8fe7b094e79fa3ee90a8e7ae8fefbe93efbe80
UHC 箏???矣?箏?? 111011101011010000111111001111110011111111101011111110000011111111101110101101000011111100111111 eeb43f3f3febf83feeb43f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)