To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 貉ソ逍セ髴呎ケソ邵柯貉ソ逍セ髴呎ケソ邵 111001101011100110111111111001111001011010111110111010011001110010011001111001101011100110111111111001111011100010011110011010001110011010111001101111111110011110010110101111101110100110011100100110011110011010111001101111111110011110111000 e6b9bfe796bee99c99e6b9bfe7b89e68e6b9bfe796bee99c99e6b9bfe7b8
EUC-JP 貉ソ逍セ髴呎ケソ邵柯貉ソ逍セ髴呎ケソ邵 1110110010111011100011101011111111101101111101101000111010111110111100011111110011010010111010001000111010111001100011101011111111101110101110101101101111001001111011001011101110001110101111111110110111110110100011101011111011110001111111001101001011101000100011101011100110001110101111111110111010111010 ecbb8ebfedf68ebef1fcd2e88eb98ebfeebadbc9ecbb8ebfedf68ebef1fcd2e88eb98ebfeeba
UTF-8 貉ソ逍セ髴呎ケソ邵柯貉ソ逍セ髴呎ケソ邵 111010001011001010001001111011111011110110111111111010011000000010001101111011111011110110111110111010011010101110110100111001011001000110001110111011111011110110111001111011111011110110111111111010011000001010110101111001101001111110101111111010001011001010001001111011111011110110111111111010011000000010001101111011111011110110111110111010011010101110110100111001011001000110001110111011111011110110111001111011111011110110111111111010011000001010110101 e8b289efbdbfe9808defbdbee9abb4e5918eefbdb9efbdbfe982b5e69fafe8b289efbdbfe9808defbdbee9abb4e5918eefbdb9efbdbfe982b5
UHC ??逍?????邵柯??逍?????邵 001111110011111111100001110011100011111100111111001111110011111100111111111000011101000011001010101011110011111100111111111000011100111000111111001111110011111100111111001111111110000111010000 3f3fe1ce3f3f3f3f3fe1d0caaf3f3fe1ce3f3f3f3f3fe1d0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)