To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 哀???筌??誼∽?溢??矜苑??源??癲 100010001010001100111111001111110011111111100010101000110011111100111111100010110110001010000001111001000011111110001000111011000011111100111111111000011110000010001001100100010011111100111111100011001011100100111111001111111110000110011111 88a33f3f3fe2a33f3f8b6281e43f88ec3f3fe1e089913f3f8cb93f3fe19f
EUC-JP 哀???筌??誼∽?溢??矜苑??源??癲 101100001010010100111111001111110011111111100100101001010011111100111111101101011100001110100010111001100011111110110000111011100011111100111111111000101110001010110001111100010011111100111111101110001011101100111111001111111110001010100001 b0a53f3f3fe4a53f3fb5c3a2e63fb0ee3f3fe2e2b1f13f3fb8bb3f3fe2a1
UTF-8 哀읪딄퐥筌뗪퉭誼∽쫫溢껃맅矜苑롳쭓源낆젩癲 111001011001001110000000111011001001110110101010111010111001010010000100111011011001000010100101111001111010110110001100111010111001011110101010111011011000100110101101111010001010101010111100111000101000100010111101111011001010101110101011111001101011101010100010111010101011101110000011111010111010011110000101111001111001111110011100111010001000101110010001111010111010000110110011111011001010110110010011111001101011101010010000111010111000001010000110111011001010000010101001111001111001100110110010 e59380ec9daaeb9484ed90a5e7ad8ceb97aaed89ade8aabce288bdecababe6baa2eabb83eba785e79f9ce88b91eba1b3ecad93e6ba90eb8286eca0a9e799b2
UHC 哀읪딄퐥筌뗪퉭誼∽쫫溢껃맅矜苑롳쭓源낆젩癲 111001001110111010011111110100011000101011101010101111011000111011101111101001111000101111101010101110011000010111101011111111101010000111101111101001101000010011101100111011101000001111100101100100001001111111010000111010001110101010111101100011101110111110100111100010111110101010111001100001011110110010100000101000011110111110100110 e4ee9fd18aeabd8eefa78beab985ebfea1efa684ecee83e5909fd0e8eabd8eefa78beab985eca0a1efa6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)