To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 円??節ゆ?檣??円??節??薔??円 100010010111111000111111001111111001000011011111100000101110010000111111100111101111110000111111001111111000100101111110001111110011111110010000110111110011111100111111111001010100101100111111001111111000100101111110 897e3f3f90df82e43f9efc3f3f897e3f3f90df3f3fe54b3f3f897e
EUC-JP 円??節ゆ?檣??円??節??薔??円 101100011101111100111111001111111100000011100001101001001110011000111111110111001111111000111111001111111011000111011111001111110011111111000000111000010011111100111111111010011010110000111111001111111011000111011111 b1df3f3fc0e1a4e63fdcfe3f3fb1df3f3fc0e13f3fe9ac3f3fb1df
UTF-8 円띨떉節ゆ쇂檣쏙쉬円띨쉵節⒵쇂薔⑼쉠円 111001011000011010000110111010111001110110101000111010111001011010001001111001111010111110000000111000111000001010000110111011001000011110000010111001101010101010100011111011001000111110011001111011001000100110101100111001011000011010000110111010111001110110101000111011001000100110110101111001111010111110000000111000101001001010110101111011001000011110000010111010001001011010010100111000101001000110111100111011001000100110100000111001011000011010000110 e58686eb9da8eb9689e7af80e38286ec8782e6aaa3ec8f99ec89ace58686eb9da8ec89b5e7af80e292b5ec8782e89694e291bcec89a0e58686
UHC 円띨떉節ゆ쇂檣쏙쉬円띨쉵節⒵쇂薔⑼쉠円 1110010111110111101101101110111010001011100111111110111110111101101010101110011010011001101101101110110111101010101111011110111110111101101011001110010111110111101101101110111010011010100010111110111110111101101010011110011010011001101101101110110111111001101010011110111110111101101010101110010111110111 e5f7b6ee8b9fefbdaae699b6edeabdefbdace5f7b6ee9a8befbda9e699b6edf9a9efbdaae5f7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)