To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????nR????n^[????nR????n^[^ 001111110011111100111111001111110110111001010010001111110011111100111111001111110110111001011110010110110011111100111111001111110011111101101110010100100011111100111111001111110011111101101110010111100101101101011110 3f3f3f3f6e523f3f3f3f6e5e5b3f3f3f3f6e523f3f3f3f6e5e5b5e
SJIS-WIN 處?處?nR處?處?n^[處?處?nR處?處?n^[^ 1001100101111100001111111001100101111100001111110110111001010010100110010111110000111111100110010111110000111111011011100101111001011011100110010111110000111111100110010111110000111111011011100101001010011001011111000011111110011001011111000011111101101110010111100101101101011110 997c3f997c3f6e52997c3f997c3f6e5e5b997c3f997c3f6e52997c3f997c3f6e5e5b5e
EUC-JP 處?處?nR處?處?n^[處?處?nR處?處?n^[^ 1101000111011101001111111101000111011101001111110110111001010010110100011101110100111111110100011101110100111111011011100101111001011011110100011101110100111111110100011101110100111111011011100101001011010001110111010011111111010001110111010011111101101110010111100101101101011110 d1dd3fd1dd3f6e52d1dd3fd1dd3f6e5e5bd1dd3fd1dd3f6e52d1dd3fd1dd3f6e5e5b5e
UTF-8 處셈處셈nR處셈處셈n^[處셈處셈nR處셈處셈n^[^ 1110100010011001100101011110110010000101100010001110100010011001100101011110110010000101100010000110111001010010111010001001100110010101111011001000010110001000111010001001100110010101111011001000010110001000011011100101111001011011111010001001100110010101111011001000010110001000111010001001100110010101111011001000010110001000011011100101001011101000100110011001010111101100100001011000100011101000100110011001010111101100100001011000100001101110010111100101101101011110 e89995ec8588e89995ec85886e52e89995ec8588e89995ec85886e5e5be89995ec8588e89995ec85886e52e89995ec8588e89995ec85886e5e5b5e
UHC 處셈處셈nR處셈處셈n^[處셈處셈nR處셈處셈n^[^ 11110100101001011011110011000000111101001010010110111100110000000110111001010010111101001010010110111100110000001111010010100101101111001100000001101110010111100101101111110100101001011011110011000000111101001010010110111100110000000110111001010010111101001010010110111100110000001111010010100101101111001100000001101110010111100101101101011110 f4a5bcc0f4a5bcc06e52f4a5bcc0f4a5bcc06e5e5bf4a5bcc0f4a5bcc06e52f4a5bcc0f4a5bcc06e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)