To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 省ア湿蔀硝鮠ョ}v省ア湿蔀硝鮠ョ}vB 100011111100100010110001100011101011110011110010101100101000111011000001100011111100100111101001101111001010111001111101011101101000111111001000101100011000111010111100111100101011001010001110110000011000111111001001111010011011110010101110011111010111011001000010 8fc8b18ebcf2b28ec18fc9e9bcae7d768fc8b18ebcf2b28ec18fc9e9bcae7d7642
EUC-JP 省ア湿?蔀硝鮠ョ}v省ア湿?蔀硝鮠ョ}vB 1011111011001010100011101011000110111100101111100011111110111100110000111011111011001011111100101011111010001110101011100111110101110110101111101100101010001110101100011011110010111110001111111011110011000011101111101100101111110010101111101000111010101110011111010111011001000010 beca8eb1bcbe3fbcc3becbf2be8eae7d76beca8eb1bcbe3fbcc3becbf2be8eae7d7642
UTF-8 省ア湿蔀硝鮠ョ}v省ア湿蔀硝鮠ョ}vB 1110011110011100100000011110111110111101101100011110011010111001101111111110111010000111101010011110100010010100100000001110011110100001100111011110100110101110101000001110111110111101101011100111110101110110111001111001110010000001111011111011110110110001111001101011100110111111111011101000011110101001111010001001010010000000111001111010000110011101111010011010111010100000111011111011110110101110011111010111011001000010 e79c81efbdb1e6b9bfee87a9e89480e7a19de9aea0efbdae7d76e79c81efbdb1e6b9bfee87a9e89480e7a19de9aea0efbdae7d7642
UHC 省????硝??}v省????硝??}vB 11100000111111010011111100111111001111110011111111110101101001100011111100111111011111010111011011100000111111010011111100111111001111110011111111110101101001100011111100111111011111010111011001000010 e0fd3f3f3f3ff5a63f3f7d76e0fd3f3f3f3ff5a63f3f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)