To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蜈?К嵬よ?鸚??蜈?К嵬よ?鸚??^ 1110010110000101001111111000010001001011100110111100101010000010111001100011111111101010010111110011111100111111111001011000010100111111100001000100101110011011110010101000001011100110001111111110101001011111001111110011111101011110 e5853f844b9bca82e63fea5f3f3fe5853f844b9bca82e63fea5f3f3f5e
EUC-JP 蜈?К嵬よ?鸚??蜈?К嵬よ?鸚??^ 1110100111100101001111111010011110101100110101101100110010100100111010000011111111110011110000000011111100111111111010011110010100111111101001111010110011010110110011001010010011101000001111111111001111000000001111110011111101011110 e9e53fa7acd6cca4e83ff3c03f3fe9e53fa7acd6cca4e83ff3c03f3f5e
UTF-8 蜈좂К嵬よ쪧鸚㎪뤃蜈좂К嵬よ쪧鸚㎪뤃^ 1110100010011100100010001110110010100010100000101101000010011010111001011011010110101100111000111000001010001000111011001010101010100111111010011011100010011010111000111000111010101010111010111010010010000011111010001001110010001000111011001010001010000010110100001001101011100101101101011010110011100011100000101000100011101100101010101010011111101001101110001001101011100011100011101010101011101011101001001000001101011110 e89c88eca282d09ae5b5ace38288ecaaa7e9b89ae38eaaeba483e89c88eca282d09ae5b5ace38288ecaaa7e9b89ae38eaaeba4835e
UHC 蜈좂К嵬よ쪧鸚㎪뤃蜈좂К嵬よ쪧鸚㎪뤃^ 11101000101001011010000011100111101011001010110011101000111000111010101011101000101001011010000011100101101001001010011111100110100011111011010011101000101001011010000011100111101011001010110011101000111000111010101011101000101001011010000011100101101001001010011111100110100011111011010001011110 e8a5a0e7acace8e3aae8a5a0e5a4a7e68fb4e8a5a0e7acace8e3aae8a5a0e5a4a7e68fb45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)