To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???^nR???^n^[???^nR???^n^[^ 001111110011111100111111010111100110111001010010001111110011111100111111010111100110111001011110010110110011111100111111001111110101111001101110010100100011111100111111001111110101111001101110010111100101101101011110 3f3f3f5e6e523f3f3f5e6e5e5b3f3f3f5e6e523f3f3f5e6e5e5b5e
SJIS-WIN 倭??^nR倭??^n^[倭??^nR倭??^n^[^ 10011000011000000011111100111111010111100110111001010010100110000110000000111111001111110101111001101110010111100101101110011000011000000011111100111111010111100110111001010010100110000110000000111111001111110101111001101110010111100101101101011110 98603f3f5e6e5298603f3f5e6e5e5b98603f3f5e6e5298603f3f5e6e5e5b5e
EUC-JP 倭??^nR倭??^n^[倭??^nR倭??^n^[^ 11001111110000010011111100111111010111100110111001010010110011111100000100111111001111110101111001101110010111100101101111001111110000010011111100111111010111100110111001010010110011111100000100111111001111110101111001101110010111100101101101011110 cfc13f3f5e6e52cfc13f3f5e6e5e5bcfc13f3f5e6e52cfc13f3f5e6e5e5b5e
UTF-8 倭띰슈^nR倭띰슈^n^[倭띰슈^nR倭띰슈^n^[^ 111001011000000010101101111010111001110110110000111011001000101010001000010111100110111001010010111001011000000010101101111010111001110110110000111011001000101010001000010111100110111001011110010110111110010110000000101011011110101110011101101100001110110010001010100010000101111001101110010100101110010110000000101011011110101110011101101100001110110010001010100010000101111001101110010111100101101101011110 e580adeb9db0ec8a885e6e52e580adeb9db0ec8a885e6e5e5be580adeb9db0ec8a885e6e52e580adeb9db0ec8a885e6e5e5b5e
UHC 倭띰슈^nR倭띰슈^n^[倭띰슈^nR倭띰슈^n^[^ 111010001101111010110110111011111011110110110100010111100110111001010010111010001101111010110110111011111011110110110100010111100110111001011110010110111110100011011110101101101110111110111101101101000101111001101110010100101110100011011110101101101110111110111101101101000101111001101110010111100101101101011110 e8deb6efbdb45e6e52e8deb6efbdb45e6e5e5be8deb6efbdb45e6e52e8deb6efbdb45e6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)