To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?竊?閃??毋?閃??感????秦????B 00111111111000101000011000111111100100010100110100111111001111111001111101111000001111111001000101001101001111110011111110001010101101000011111100111111001111110011111110010000011000000011111100111111001111110011111101000010 3fe2863f914d3f3f9f783f914d3f3f8ab43f3f3f3f90603f3f3f3f42
EUC-JP ?竊?閃??毋?閃??感????秦????B 00111111111000111110011000111111110000011010111000111111001111111101110111011001001111111100000110101110001111110011111110110100101101100011111100111111001111110011111110111111110000010011111100111111001111110011111101000010 3fe3e63fc1ae3f3fddd93fc1ae3f3fb4b63f3f3f3fbfc13f3f3f3f42
UTF-8 뤛竊봅閃퉶凜毋봅閃체쫸感성얀렫롌秦렭얀렻샷B 11101011101001001001101111100111101010111000101011101011101101001000010111101001100101101000001111101101100010011011011011101111101001011001010011100110101011111000101111101011101101001000010111101001100101101000001111101100101100101011010011101100101010111011100011100110100001001001111111101100100001001011000111101100100101101000000011101011101000001010101111101011101000011000110011100111101001111010011011101011101000001010110111101100100101101000000011101011101000001011101111101100100000111011011101000010 eba49be7ab8aebb485e99683ed89b6efa594e6af8bebb485e99683ecb2b4ecabb8e6849fec84b1ec9680eba0abeba18ce7a7a6eba0adec9680eba0bbec83b742
UHC 뤛竊봅閃퉶凜毋봅閃체쫸感성얀렫롌秦렭얀렻샷B 10001111110010101110111110111100101110101011111011100000111011001011100110001110110100101111001011011001111011001011101010111110111000001110110011000011101111001010011010001111110010101110111110111100101110101011111011100001100011101011100110001110110100101111001011011010100011101011101010111110111000011000111011000011101111001010011001000010 8fcaefbcbabee0ecb98ed2f2d9ecbabee0ecc3bca68fcaefbcbabee18eb98ed2f2da8ebabee18ec3bca642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)