To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???靭??膺??如??????意℡?鷹?? 00111111001111110011111110010000011110000011111100111111111001000101111000111111001111111001010001000000001111110011111100111111001111110011111100111111100010001101001110000111100001000011111110010001111010010011111100111111 3f3f3f90783f3fe45e3f3f94403f3f3f3f3f3f88d387843f91e93f3f
EUC-JP ???靭??膺??如??????意??鷹?? 001111110011111100111111101111111101100100111111001111111110011110111111001111110011111111000111101000010011111100111111001111110011111100111111001111111011000011010101001111110011111111000010111010110011111100111111 3f3f3fbfd93f3fe7bf3f3fc7a13f3f3f3f3f3fb0d53f3fc2eb3f3f
UTF-8 麗몃쓷靭듿칰膺우뒇如붞븐퓛蓮곥룗意℡칰鷹곴석 111011111010011010001000111010111010101010000011111011001001001110110111111010011001110110101101111010111001001110111111111011001011100110110000111010001000011010111010111011001001101010110000111010111001001010000111111001011010011010000010111010111011011010011110111010111011100010010000111011011001001110011011111011111010011010011001111010101011001110100101111010111010001110010111111001101000010010001111111000101000010010100001111011001011100110110000111010011011011110111001111010101011001110110100111011001000010010011101 efa688ebaa83ec93b7e99dadeb93bfecb9b0e886baec9ab0eb9287e5a682ebb69eebb890ed939befa699eab3a5eba397e6848fe284a1ecb9b0e9b7b9eab3b4ec849d
UHC 麗몃쓷靭듿칰膺우뒇如붞븐퓛蓮곥룗意℡칰鷹곴석 1110011010110000101110001110101110011101100101001110110011100101100010101110010110101111100000111110101111101100101111111110110010001010100001011110010111111101100101001100111010111010111011001011111110000110111001101110010110000001111000111000111110010011111010111111001010100010111001011010111110000011111010111110110110000001111010101011110010101110 e6b0b8eb9d94ece58ae5af83ebecbfec8a85e5fd94cebaecbf86e6e581e38f93ebf2a2e5af83ebed81eabcae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)