To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 移?鬱??蝎e蹟?爾魄移?鬱??蝎e蹟?爾白^ 10001000110110100011111110011111010101000011111100111111111001011001100110000010100001011001000011010110001111111000111010100010111010011010111010001000110110100011111110011111010101000011111100111111111001011001100110000010100001011001000011010110001111111000111010100010100101001001001001011110 88da3f9f543f3fe599828590d63f8ea2e9ae88da3f9f543f3fe599828590d63f8ea294925e
EUC-JP 移?鬱??蝎e蹟?爾魄移?鬱??蝎e蹟?爾白^ 10110000110111000011111111011101101101010011111100111111111010011111100110100011111001011100000011011000001111111011110010100100111100101011000010110000110111000011111111011101101101010011111100111111111010011111100110100011111001011100000011011000001111111011110010100100110001111111001001011110 b0dc3fddb53f3fe9f9a3e5c0d83fbca4f2b0b0dc3fddb53f3fe9f9a3e5c0d83fbca4c7f25e
UTF-8 移렯鬱렚溜蝎e蹟렏爾魄移렯鬱렚溜蝎e蹟렏爾白^ 11100111101001111011101111101011101000001010111111101001101011001011000111101011101000001001101011101111101001111000101111101000100111011000111011101111101111011000010111101000101110011001111111101011101000001000111111100111100010001011111011101001101011011000010011100111101001111011101111101011101000001010111111101001101011001011000111101011101000001001101011101111101001111000101111101000100111011000111011101111101111011000010111101000101110011001111111101011101000001000111111100111100010001011111011100111100110011011110101011110 e7a7bbeba0afe9acb1eba09aefa78be89d8eefbd85e8b99feba08fe788bee9ad84e7a7bbeba0afe9acb1eba09aefa78be89d8eefbd85e8b99feba08fe788bee799bd5e
UHC 移렯鬱렚溜蝎e蹟렏爾魄移렯鬱렚溜蝎e蹟렏爾白^ 111011001011100110001110101111001110101010100110100011101010110111101010111111101100101011101001101000111110010111101110111001111000111010100101111011001011001111011011110111101110110010111001100011101011110011101010101001101000111010101101111010101111111011001010111010011010001111100101111011101110011110001110101001011110110010110011110110111101110001011110 ecb98ebceaa68eadeafecae9a3e5eee78ea5ecb3dbdeecb98ebceaa68eadeafecae9a3e5eee78ea5ecb3dbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)