To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 閼ア蟄倩∮隱ー譚台ソ抑閼ア蟄倩∮隱ー譚台ソ養^ 111010001000010010110001111001011010110110011000111010001000011110010011111010001010101010110000111001101001110110010001111001001011111110010111011111011110100010000100101100011110010110101101100110001110100010000111100100111110100010101010101100001110011010011101100100011110010010111111100101110111101101011110 e884b1e5ad98e88793e8aab0e69d91e4bf977de884b1e5ad98e88793e8aab0e69d91e4bf977b5e
EUC-JP 閼ア蟄倩?隱ー譚台ソ抑閼ア蟄倩?隱ー譚台ソ養^ 11101111111001001000111010110001111010101010111111010000111010100011111111110000101011001000111010110000111010111111110111000010111001101000111010111111110011011101111011101111111001001000111010110001111010101010111111010000111010100011111111110000101011001000111010110000111010111111110111000010111001101000111010111111110011011101110001011110 efe48eb1eaafd0ea3ff0ac8eb0ebfdc2e68ebfcddeefe48eb1eaafd0ea3ff0ac8eb0ebfdc2e68ebfcddc5e
UTF-8 閼ア蟄倩∮隱ー譚台ソ抑閼ア蟄倩∮隱ー譚台ソ養^ 11101001100101101011110011101111101111011011000111101000100111111000010011100101100000001010100111100010100010001010111011101001100110101011000111101111101111011011000011101000101011011001101011100101100011111011000011101111101111011011111111100110100010101001000111101001100101101011110011101111101111011011000111101000100111111000010011100101100000001010100111100010100010001010111011101001100110101011000111101111101111011011000011101000101011011001101011100101100011111011000011101111101111011011111111101001101001001000101001011110 e996bcefbdb1e89f84e580a9e288aee99ab1efbdb0e8ad9ae58fb0efbdbfe68a91e996bcefbdb1e89f84e580a9e288aee99ab1efbdb0e8ad9ae58fb0efbdbfe9a48a5e
UHC 閼?蟄?∮隱?譚台?抑閼?蟄?∮隱?譚台?養^ 11100100110110010011111111110110110111100011111110100010101100011110101111011111001111111101001111001001111101111011101100111111111001011110010011100100110110010011111111110110110111100011111110100010101100011110101111011111001111111101001111001001111101111011101100111111111001011101011101011110 e4d93ff6de3fa2b1ebdf3fd3c9f7bb3fe5e4e4d93ff6de3fa2b1ebdf3fd3c9f7bb3fe5d75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)