To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 岳??孺??儒??孃る???Ⅷ膺??堊 100010100111100000111111001111111001101101111101001111110011111110001110111100100011111100111111100110110110111110000010111010010011111100111111001111111000011101011011111001000101111000111111001111111001101010111111 8a783f3f9b7d3f3f8ef23f3f9b6f82e93f3f3f875be45e3f3f9abf
EUC-JP 岳??孺??儒?Œ孃る????膺??堊 10110011110110010011111100111111110101011101111000111111001111111011110011110100001111111000111110101001101011011101010111010000101001001110101100111111001111110011111100111111111001111011111100111111001111111101010011000001 b3d93f3fd5de3f3fbcf43f8fa9add5d0a4eb3f3f3f3fe7bf3f3fd4c1
UTF-8 岳묒빘孺욤짆儒룹Œ孃る끏吏롳Ⅷ膺꾨옖堊 1110010110110010101100111110101110101100100100101110101110111001100110001110010110101101101110101110110010011010101001001110110010100111100001101110010110000100100100101110101110100011101110011100010110010010111001011010110110000011111000111000001010001011111010111000000110001111111011111010011110011110111010111010000110110011111000101000010110100111111010001000011010111010111010101011111010101000111011001001100010010110111001011010000010001010 e5b2b3ebac92ebb998e5adbaec9aa4eca786e58492eba3b9c592e5ad83e3828beb818fefa79eeba1b3e285a7e886baeabea8ec9896e5a08a
UHC 岳묒빘孺욤짆儒룹Œ孃る끏吏롳Ⅷ膺꾨옖堊 1110010010111111100100011110110010010101101110011110101011101000101111111110100010100011100101011110101011100011101101111110110010101000101010111110010110111110101010101110101110000101101111111110110010100111100011101110111110100101101101111110101111101100100001001110101110011110100111001110010010111110 e4bf91ec95b9eae8bfe8a395eae3b7eca8abe5beaaeb85bfeca78eefa5b7ebec84eb9e9ce4be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)