To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 褶ィ螻・褶ィ蜷æ 11101000101001001011011011101111101111011010100011101000100111101011101111101111101111011010010111101000101001001011011011101111101111011010100011101000100111001011011111100110 e8a4b6efbda8e89ebbefbda5e8a4b6efbda8e89cb7e6
SJIS-WIN ??¶??¨?????¥??¶??¨???? 001111110011111110000001111101110011111100111111100000010100111000111111001111110011111100111111001111111000000110001111001111110011111110000001111101110011111100111111100000010100111000111111001111110011111100111111 3f3f81f73f3f814e3f3f3f3f3f818f3f3f81f73f3f814e3f3f3f3f
EUC-JP 褶ï?¨è??ï??褶ï?¨è??æ 10001111101010111011001010001111101000101111000010100010111110011000111110101011110000010011111110100001101011111000111110101011101100100011111100111111100011111010101111000001001111110011111110001111101010111011001010001111101000101111000010100010111110011000111110101011110000010011111110100001101011111000111110101011101100100011111100111111100011111010100111000001 8fabb28fa2f0a2f98fabc13fa1af8fabb23f3f8fabc13f3f8fabb28fa2f0a2f98fabc13fa1af8fabb23f3f8fa9c1
UTF-8 褶ィ螻・褶ィ蜷æ 1100001110101000110000101010010011000010101101101100001110101111110000101011110111000010101010001100001110101000110000101001111011000010101110111100001110101111110000101011110111000010101001011100001110101000110000101010010011000010101101101100001110101111110000101011110111000010101010001100001110101000110000101001110011000010101101111100001110100110 c3a8c2a4c2b6c3afc2bdc2a8c3a8c29ec2bbc3afc2bdc2a5c3a8c2a4c2b6c3afc2bdc2a8c3a8c29cc2b7c3a6
UHC ?¤¶?½¨????½??¤¶?½¨??·æ 001111111010001010110100101000101101001000111111101010001111011010100001101001110011111100111111001111110011111110101000111101100011111100111111101000101011010010100010110100100011111110101000111101101010000110100111001111110011111110100001101001001010100110100001 3fa2b4a2d23fa8f6a1a73f3f3f3fa8f63f3fa2b4a2d23fa8f6a1a73f3fa1a4a9a1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)