To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??z]]nf??z]]n^}Y??z]]nf??z]]n^}bE 001111110011111101111010010111010101110101101110011001100011111100111111011110100101110101011101011011100101111001111101010110010011111100111111011110100101110101011101011011100110011000111111001111110111101001011101010111010110111001011110011111010110001001000101 3f3f7a5d5d6e663f3f7a5d5d6e5e7d593f3f7a5d5d6e663f3f7a5d5d6e5e7d6245
SJIS-WIN 唯嗇z]]nf唯嗇z]]n^}Y唯嗇z]]nf唯嗇z]]n^}bE 1001011101000010100110101010010101111010010111010101110101101110011001101001011101000010100110101010010101111010010111010101110101101110010111100111110101011001100101110100001010011010101001010111101001011101010111010110111001100110100101110100001010011010101001010111101001011101010111010110111001011110011111010110001001000101 97429aa57a5d5d6e6697429aa57a5d5d6e5e7d5997429aa57a5d5d6e6697429aa57a5d5d6e5e7d6245
EUC-JP 唯嗇z]]nf唯嗇z]]n^}Y唯嗇z]]nf唯嗇z]]n^}bE 1100110110100011110101001010011101111010010111010101110101101110011001101100110110100011110101001010011101111010010111010101110101101110010111100111110101011001110011011010001111010100101001110111101001011101010111010110111001100110110011011010001111010100101001110111101001011101010111010110111001011110011111010110001001000101 cda3d4a77a5d5d6e66cda3d4a77a5d5d6e5e7d59cda3d4a77a5d5d6e66cda3d4a77a5d5d6e5e7d6245
UTF-8 唯嗇z]]nf唯嗇z]]n^}Y唯嗇z]]nf唯嗇z]]n^}bE 11100101100101001010111111100101100101111000011101111010010111010101110101101110011001101110010110010100101011111110010110010111100001110111101001011101010111010110111001011110011111010101100111100101100101001010111111100101100101111000011101111010010111010101110101101110011001101110010110010100101011111110010110010111100001110111101001011101010111010110111001011110011111010110001001000101 e594afe597877a5d5d6e66e594afe597877a5d5d6e5e7d59e594afe597877a5d5d6e66e594afe597877a5d5d6e5e7d6245
UHC 唯嗇z]]nf唯嗇z]]n^}Y唯嗇z]]nf唯嗇z]]n^}bE 1110101011100110110111111110000001111010010111010101110101101110011001101110101011100110110111111110000001111010010111010101110101101110010111100111110101011001111010101110011011011111111000000111101001011101010111010110111001100110111010101110011011011111111000000111101001011101010111010110111001011110011111010110001001000101 eae6dfe07a5d5d6e66eae6dfe07a5d5d6e5e7d59eae6dfe07a5d5d6e66eae6dfe07a5d5d6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)