To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弱??弱????【節ヨ?弱??抑?ァ? 100011101110001100111111001111111000111011100011001111110011111100111111001111111000000101111001100100001101111110000011100010000011111110001110111000110011111100111111100101110111110100111111100000110100000000111111 8ee33f3f8ee33f3f3f3f817990df83883f8ee33f3f977d3f83403f
EUC-JP 弱??弱????【節ヨ?弱??抑?ァ? 101111001110010100111111001111111011110011100101001111110011111100111111001111111010000111011010110000001110000110100101111010000011111110111100111001010011111100111111110011011101111000111111101001011010000100111111 bce53f3fbce53f3f3f3fa1dac0e1a5e83fbce53f3fcdde3fa5a13f
UTF-8 弱놅쉿弱놅풕若든【節ヨ뀸弱놅풓抑섊ァ溫 111001011011110010110001111010111000011010000101111011001000100110111111111001011011110010110001111010111000011010000101111011011001001010010101111011111010010110110100111010111001001110100000111000111000000010010000111001111010111110000000111000111000001110101000111010111000000010111000111001011011110010110001111010111000011010000101111011011001001010010011111001101000101010010001111011001000010010001010111000111000001010100001111001101011101010101011 e5bcb1eb8685ec89bfe5bcb1eb8685ed9295efa5b4eb93a0e38090e7af80e383a8eb80b8e5bcb1eb8685ed9293e68a91ec848ae382a1e6baab
UHC 弱놅쉿弱놅풕若든【節ヨ뀸弱놅풓抑섊ァ溫 1110010110110000100001101110111110111101101100101110010110110000100001101110111110111110100110001110010110101110101101011110011110100001101111001110111110111101101010111110100010000101101011101110010110110000100001101110111110111110100101111110010111100100100110001110011110101011101000011110100010101110 e5b086efbdb2e5b086efbe98e5aeb5e7a1bcefbdabe885aee5b086efbe97e5e498e7aba1e8ae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)