To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nR???n^[???nR???n^[^ 0011111100111111001111110110111001010010001111110011111100111111011011100101111001011011001111110011111100111111011011100101001000111111001111110011111101101110010111100101101101011110 3f3f3f6e523f3f3f6e5e5b3f3f3f6e523f3f3f6e5e5b5e
SJIS-WIN ?鞨?nR?鞨?n^[?鞨?nR?鞨?n^[^ 001111111110100011100000001111110110111001010010001111111110100011100000001111110110111001011110010110110011111111101000111000000011111101101110010100100011111111101000111000000011111101101110010111100101101101011110 3fe8e03f6e523fe8e03f6e5e5b3fe8e03f6e523fe8e03f6e5e5b5e
EUC-JP 塼鞨?nR塼鞨?n^[塼鞨?nR塼鞨?n^[^ 1000111110111000101110011111000011100010001111110110111001010010100011111011100010111001111100001110001000111111011011100101111001011011100011111011100010111001111100001110001000111111011011100101001010001111101110001011100111110000111000100011111101101110010111100101101101011110 8fb8b9f0e23f6e528fb8b9f0e23f6e5e5b8fb8b9f0e23f6e528fb8b9f0e23f6e5e5b5e
UTF-8 塼鞨혁nR塼鞨혁n^[塼鞨혁nR塼鞨혁n^[^ 1110010110100001101111001110100110011110101010001110110110011000100000010110111001010010111001011010000110111100111010011001111010101000111011011001100010000001011011100101111001011011111001011010000110111100111010011001111010101000111011011001100010000001011011100101001011100101101000011011110011101001100111101010100011101101100110001000000101101110010111100101101101011110 e5a1bce99ea8ed98816e52e5a1bce99ea8ed98816e5e5be5a1bce99ea8ed98816e52e5a1bce99ea8ed98816e5e5b5e
UHC 塼鞨혁nR塼鞨혁n^[塼鞨혁nR塼鞨혁n^[^ 1110111011110100110010101110101011000111111101010110111001010010111011101111010011001010111010101100011111110101011011100101111001011011111011101111010011001010111010101100011111110101011011100101001011101110111101001100101011101010110001111111010101101110010111100101101101011110 eef4caeac7f56e52eef4caeac7f56e5e5beef4caeac7f56e52eef4caeac7f56e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)