To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????nR????n^[????nR????n^[^ 001111110011111100111111001111110110111001010010001111110011111100111111001111110110111001011110010110110011111100111111001111110011111101101110010100100011111100111111001111110011111101101110010111100101101101011110 3f3f3f3f6e523f3f3f3f6e5e5b3f3f3f3f6e523f3f3f3f6e5e5b5e
SJIS-WIN 臟???nR臟???n^[臟???nR臟???n^[^ 11100100011001100011111100111111001111110110111001010010111001000110011000111111001111110011111101101110010111100101101111100100011001100011111100111111001111110110111001010010111001000110011000111111001111110011111101101110010111100101101101011110 e4663f3f3f6e52e4663f3f3f6e5e5be4663f3f3f6e52e4663f3f3f6e5e5b5e
EUC-JP 臟??駙nR臟??駙n^[臟??駙nR臟??駙n^[^ 111001111100011100111111001111111000111111101001101010010110111001010010111001111100011100111111001111111000111111101001101010010110111001011110010110111110011111000111001111110011111110001111111010011010100101101110010100101110011111000111001111110011111110001111111010011010100101101110010111100101101101011110 e7c73f3f8fe9a96e52e7c73f3f8fe9a96e5e5be7c73f3f8fe9a96e52e7c73f3f8fe9a96e5e5b5e
UTF-8 臟펠렫駙nR臟펠렫駙n^[臟펠렫駙nR臟펠렫駙n^[^ 1110100010000111100111111110110110001110101000001110101110100000101010111110100110100111100110010110111001010010111010001000011110011111111011011000111010100000111010111010000010101011111010011010011110011001011011100101111001011011111010001000011110011111111011011000111010100000111010111010000010101011111010011010011110011001011011100101001011101000100001111001111111101101100011101010000011101011101000001010101111101001101001111001100101101110010111100101101101011110 e8879fed8ea0eba0abe9a7996e52e8879fed8ea0eba0abe9a7996e5e5be8879fed8ea0eba0abe9a7996e52e8879fed8ea0eba0abe9a7996e5e5b5e
UHC 臟펠렫駙nR臟펠렫駙n^[臟펠렫駙nR臟펠렫駙n^[^ 11101101111101001100011011100111100011101011100111011101101111110110111001010010111011011111010011000110111001111000111010111001110111011011111101101110010111100101101111101101111101001100011011100111100011101011100111011101101111110110111001010010111011011111010011000110111001111000111010111001110111011011111101101110010111100101101101011110 edf4c6e78eb9ddbf6e52edf4c6e78eb9ddbf6e5e5bedf4c6e78eb9ddbf6e52edf4c6e78eb9ddbf6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)