To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 曜??節??予??箋??節?????絶?『^ 1001011101101010001111110011111110010000110111110011111100111111100101110101110000111111001111111110001010110011001111110011111110010000110111110011111100111111001111110011111100111111100100001110001000111111100000010111011101011110 976a3f3f90df3f3f975c3f3fe2b33f3f90df3f3f3f3f3f90e23f81775e
EUC-JP 曜??節??予??箋??節?????絶?『^ 1100110111001011001111110011111111000000111000010011111100111111110011011011110100111111001111111110010010110101001111110011111111000000111000010011111100111111001111110011111100111111110000001110010000111111101000011101100001011110 cdcb3f3fc0e13f3fcdbd3f3fe4b53f3fc0e13f3f3f3f3fc0e43fa1d85e
UTF-8 曜쀯슭節ㅶ씭予숋풓箋섌슅節억숴呂얏슇絶쀧『^ 11100110100110111001110011101100100000001010111111101100100010101010110111100111101011111000000011100011100001011011011011101100100101001010110111100100101110101000100011101100100010001000101111101101100100101001001111100111101011101000101111101100100001001000110011101100100010101000010111100111101011111000000011101100100101101011010111101100100010001011010011101111101001101000000011101100100101101000111111101100100010101000011111100111101101011011011011101100100000001010011111100011100000001000111001011110 e69b9cec80afec8aade7af80e385b6ec94ade4ba88ec888bed9293e7ae8bec848cec8a85e7af80ec96b5ec88b4efa680ec968fec8a87e7b5b6ec80a7e3808e5e
UHC 曜쀯슭節ㅶ씭予숋풓箋섌슅節억숴呂얏슇絶쀧『^ 11101000111110001001011111101111101111011011111011101111101111011010010011100110100111011011111011100101111110001001100111101111101111101001011111101111101010001001100011101001100110101001011111101111101111011011111011101111101111011010010011100101111110111011111011100110100110101001100111101111101111101001011111100111101000011011101001011110 e8f897efbdbeefbda4e69dbee5f899efbe97efa898e99a97efbdbeefbda4e5fbbee69a99efbe97e7a1ba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)