To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??幽??艶k????釉??筌?,有 001111110011111100111111111000101000011000111111001111111001011101001000001111110011111110001001100100001000001010001011001111110011111100111111001111111110011111010110001111110011111111100010101000110011111110000001010000111001011101001100 3f3f3fe2863f3f97483f3f8990828b3f3f3f3fe7d63f3fe2a33f8143974c
EUC-JP ???竊??幽??艶k?庾??釉??筌?,有 0011111100111111001111111110001111100110001111110011111111001101101010010011111100111111101100011111000010100011111010110011111110001111101111001100111000111111001111111110111011011000001111110011111111100100101001010011111110100001101001001100110110101101 3f3f3fe3e63f3fcda93f3fb1f0a3eb3f8fbcce3f3feed83f3fe4a53fa1a4cdad
UTF-8 捻뀁뮆竊섉꼷幽껊눀艶k벡庾썲쮦釉먯뒠筌곕,有 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010001001111010101011110010110111111001011011100110111101111010101011101110001010111010111000100010000000111010001000100110110110111011111011110110001011111010111011001010100001111001011011101010111110111011001000110110110010111011001010111010100110111010011000011110001001111010111010100010101111111010111001001010100000111001111010110110001100111010101011001110010101111011111011110010001100111001101001110010001001 efa6a4eb8081ebae86e7ab8aec8489eabcb7e5b9bdeabb8aeb8880e889b6efbd8bebb2a1e5babeec8db2ecaea6e98789eba8afeb92a0e7ad8ceab395efbc8ce69c89
UHC 捻뀁뮆竊섉꼷幽껊눀艶k벡庾썲쮦釉먯뒠筌곕,有 1110011011110111101100101110110010010010100101011110111110111100100110001110011010000100100011111110101011101011100000111110101110000111101000011110011011111101101000111110101110111010101001001110101011101100101111011110010110101000100000111110101110111000100100001110110010001010100111001110111110100111101100001110101110100011101011001110101011110011 e6f7b2ec9295efbc98e6848feaeb83eb87a1e6fda3ebbaa4eaecbde5a883ebb890ec8a9cefa7b0eba3aceaf3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)