To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN ??∥???怨??z??∥???怨??zB 00111111001111111000000101100001001111110011111100111111100010011000010100111111001111110111101000111111001111111000000101100001001111110011111100111111100010011000010100111111001111110111101001000010 3f3f81613f3f3f89853f3f7a3f3f81613f3f3f89853f3f7a42
EUC-JP ??‖???怨??z??‖???怨??zB 00111111001111111010000111000010001111110011111100111111101100011110010100111111001111110111101000111111001111111010000111000010001111110011111100111111101100011110010100111111001111110111101001000010 3f3fa1c23f3f3fb1e53f3f7a3f3fa1c23f3f3fb1e53f3f7a42
UTF-8 略노∥柳롳쭓怨뺤졎z略노∥柳롳쭓怨뺤졎zB 111011111010010110110110111010111000010110111000111000101000100010100101111011111010011110001001111010111010000110110011111011001010110110010011111001101000000010101000111010111011101010100100111011001010000110001110011110101110111110100101101101101110101110000101101110001110001010001000101001011110111110100111100010011110101110100001101100111110110010101101100100111110011010000000101010001110101110111010101001001110110010100001100011100111101001000010 efa5b6eb85b8e288a5efa789eba1b3ecad93e680a8ebbaa4eca18e7aefa5b6eb85b8e288a5efa789eba1b3ecad93e680a8ebbaa4eca18e7a42
UHC 略노∥柳롳쭓怨뺤졎z略노∥柳롳쭓怨뺤졎zB 111001011011001010110011111010111010000110101011111010101111011110001110111011111010011110001011111010101011001110010101111011001010000010111011011110101110010110110010101100111110101110100001101010111110101011110111100011101110111110100111100010111110101010110011100101011110110010100000101110110111101001000010 e5b2b3eba1abeaf78eefa78beab395eca0bb7ae5b2b3eba1abeaf78eefa78beab395eca0bb7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)