To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 穩??節??壤ヨ?譯??閻??訝??晤?? 1110001001110010001111110011111110010000110111110011111100111111100110101101111110000011100010000011111111100110101000010011111100111111111010001000010100111111001111111110011001100010001111110011111110011101111010110011111100111111 e2723f3f90df3f3f9adf83883fe6a13f3fe8853f3fe6623f3f9deb3f3f
EUC-JP 穩??節??壤ヨ?譯??閻??訝??晤?? 1110001111010011001111110011111111000000111000010011111100111111110101001110000110100101111010000011111111101100101000110011111100111111111011111110010100111111001111111110101111000011001111110011111111011010111011010011111100111111 e3d33f3fc0e13f3fd4e1a5e83feca33f3fefe53f3febc33f3fdaed3f3f
UTF-8 穩며쥈節썰툧壤ヨ씇譯볩슐閻곻슬訝욃쉥晤잍춴 111001111010100110101001111010111010100110110000111011001010010110001000111001111010111110000000111011001000110110110000111011011000100010100111111001011010001110100100111000111000001110101000111011001001010010000111111010001010110110101111111010111011001110101001111011001000101010010000111010011001011010111011111010101011001110111011111011001000101010101100111010001010100010011101111011001001101010000011111011001000100110100101111001101001100110100100111011001001111010001101111011001011011010110100 e7a9a9eba9b0eca588e7af80ec8db0ed88a7e5a3a4e383a8ec9487e8adafebb3a9ec8a90e996bbeab3bbec8aace8a89dec9a83ec89a5e699a4ec9e8decb6b4
UHC 穩며쥈節썰툧壤ヨ씇譯볩슐閻곻슬訝욃쉥晤잍춴 111010001011000110111000111001111010001010000001111011111011110110111101111001001011100010011110111001011011110110101011111010001001110110011111111001101011101110010011111011111011110110110110111001111010001010000001111011111011110110111101111001001011100010011110111001011011110110101011111001111111101110011111111001101010110110010000 e8b1b8e7a281efbdbde4b89ee5bdabe89d9fe6bb93efbdb6e7a281efbdbde4b89ee5bdabe7fb9fe6ad90

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)