To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 篠自篠璽篠耳偲室篠耳偲嫉篠治偲ホト耳偲嫉 1000111011000010100011101010100110001110110000101000111010100011100011101100001010001110101010001000111011000011100011101011101010001110110000101000111010101000100011101100001110001110101110011000111011000010100011101010000110001110110000111100111011000100100011101010100010001110110000111000111010111001 8ec28ea98ec28ea38ec28ea88ec38eba8ec28ea88ec38eb98ec28ea18ec3cec48ea88ec38eb9
EUC-JP 篠自篠璽篠耳偲室篠耳偲嫉篠治偲ホト耳偲嫉 10111100110001001011110010101011101111001100010010111100101001011011110011000100101111001010101010111100110001011011110010111100101111001100010010111100101010101011110011000101101111001011101110111100110001001011110010100011101111001100010110001110110011101000111011000100101111001010101010111100110001011011110010111011 bcc4bcabbcc4bca5bcc4bcaabcc5bcbcbcc4bcaabcc5bcbbbcc4bca3bcc58ece8ec4bcaabcc5bcbb
UTF-8 篠自篠璽篠耳偲室篠耳偲嫉篠治偲ホト耳偲嫉 111001111010111110100000111010001000011110101010111001111010111110100000111001111001001010111101111001111010111110100000111010001000000010110011111001011000000110110010111001011010111010100100111001111010111110100000111010001000000010110011111001011000000110110010111001011010101110001001111001111010111110100000111001101011001010111011111001011000000110110010111011111011111010001110111011111011111010000100111010001000000010110011111001011000000110110010111001011010101110001001 e7afa0e887aae7afa0e792bde7afa0e880b3e581b2e5aea4e7afa0e880b3e581b2e5ab89e7afa0e6b2bbe581b2efbe8eefbe84e880b3e581b2e5ab89
UHC 篠自篠璽篠耳?室篠耳?嫉篠治???耳?嫉 11100001110001101110110110111011111000011100011011011111110111101110000111000110111011001011110000111111111000111111100011100001110001101110110010111100001111111111001011101100111000011100011011110110101111010011111100111111001111111110110010111100001111111111001011101100 e1c6edbbe1c6dfdee1c6ecbc3fe3f8e1c6ecbc3ff2ece1c6f6bd3f3f3fecbc3ff2ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)