To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壯↑?僥??僥??慂??絶?ぜ節??僥?? 100110101110000110000001101010100011111110011001010001100011111100111111100110010100011000111111001111111001110011001000001111110011111110010000111000100011111110000010101110101001000011011111001111110011111110011001010001100011111100111111 9ae181aa3f99463f3f99463f3f9cc83f3f90e23f82ba90df3f3f99463f3f
EUC-JP 壯↑?僥??僥??慂??絶?ぜ節??僥?? 110101001110001110100010101011000011111111010001101001110011111100111111110100011010011100111111001111111101100011001010001111110011111111000000111001000011111110100100101111001100000011100001001111110011111111010001101001110011111100111111 d4e3a2ac3fd1a73f3fd1a73f3fd8ca3f3fc0e43fa4bcc0e13f3fd1a73f3f
UTF-8 壯↑뮅僥쀧뿏僥숋풊慂딉슈絶녽ぜ節꿰뿏僥숋풊 111001011010001110101111111000101000011010010001111010111010111010000101111001011000001110100101111011001000000010100111111010111011111110001111111001011000001110100101111011001000100010001011111011011001001010001010111001101000010110000010111010111001010010001001111011001000101010001000111001111011010110110110111010111000010110111101111000111000000110011100111001111010111110000000111010101011111110110000111010111011111110001111111001011000001110100101111011001000100010001011111011011001001010001010 e5a3afe28691ebae85e583a5ec80a7ebbf8fe583a5ec888bed928ae68582eb9489ec8a88e7b5b6eb85bde3819ce7af80eabfb0ebbf8fe583a5ec888bed928a
UHC 壯↑뮅僥쀧뿏僥숋풊慂딉슈絶녽ぜ節꿰뿏僥숋풊 111011011110000010100001111010001001001010010100111010001110100110010111111001111001011110010100111010001110100110011001111011111011111010010000111010011011110110001010111011111011110110110100111011111011111010000110111010011010101010111100111011111011110110110010111001111001011110010100111010001110100110011001111011111011111010010000 ede0a1e89294e8e997e79794e8e999efbe90e9bd8aefbdb4efbe86e9aabcefbdb2e79794e8e999efbe90

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)