To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
EUC-JP 薏?????沅??[薏?????沅??[^ 1000111111011001110111100011111100111111001111110011111100111111100011111100011011101001001111110011111101011011100011111101100111011110001111110011111100111111001111110011111110001111110001101110100100111111001111110101101101011110 8fd9de3f3f3f3f3f8fc6e93f3f5b8fd9de3f3f3f3f3f8fc6e93f3f5b5e
UTF-8 薏쇨떱療껋씪沅룬쟽[薏쇨떱療껋씪沅룬쟽[^ 111010001001011010001111111011001000011110101000111010111001011010110001111011111010011110000001111010101011101110001011111011001001010010101010111001101011001010000101111010111010001110101100111011001001111110111101010110111110100010010110100011111110110010000111101010001110101110010110101100011110111110100111100000011110101010111011100010111110110010010100101010101110011010110010100001011110101110100011101011001110110010011111101111010101101101011110 e8968fec87a8eb96b1efa781eabb8bec94aae6b285eba3acec9fbd5be8968fec87a8eb96b1efa781eabb8bec94aae6b285eba3acec9fbd5b5e
UHC 薏쇨떱療껋씪沅룬쟽[薏쇨떱療껋씪沅룬쟽[^ 111010111111101110111100111010101011011010110111111010001111111010000011111011001001110110111100111010101011011010110111111010011010000010000011010110111110101111111011101111001110101010110110101101111110100011111110100000111110110010011101101111001110101010110110101101111110100110100000100000110101101101011110 ebfbbceab6b7e8fe83ec9dbceab6b7e9a0835bebfbbceab6b7e8fe83ec9dbceab6b7e9a0835b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)