To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 闕オ蛟大」暮頼諤。闕オ蛟大」暮頼謔ウ^ 11101000100011011011010111100101100000001001000111100101101000111001010111101001100101111000101011100110100000001010000111101000100011011011010111100101100000001001000111100101101000111001010111101001100101111000101011100110100000101011001101011110 e88db5e58091e5a395e9978ae680a1e88db5e58091e5a395e9978ae682b35e
EUC-JP 闕オ蛟大」暮頼諤。闕オ蛟大」暮頼謔ウ^ 11101111111011011000111010110101111010011110000011000010111001111000111010100011110010101110101111001101111010101110101111100000100011101010000111101111111011011000111010110101111010011110000011000010111001111000111010100011110010101110101111001101111010101110101111100010100011101011001101011110 efed8eb5e9e0c2e78ea3caebcdeaebe08ea1efed8eb5e9e0c2e78ea3caebcdeaebe28eb35e
UTF-8 闕オ蛟大」暮頼諤。闕オ蛟大」暮頼謔ウ^ 11101001100101111001010111101111101111011011010111101000100110111001111111100101101001001010011111101111101111011010001111100110100110101010111011101001101000001011110011101000101010111010010011101111101111011010000111101001100101111001010111101111101111011011010111101000100110111001111111100101101001001010011111101111101111011010001111100110100110101010111011101001101000001011110011101000101011001001010011101111101111011011001101011110 e99795efbdb5e89b9fe5a4a7efbda3e69aaee9a0bce8aba4efbda1e99795efbdb5e89b9fe5a4a7efbda3e69aaee9a0bce8ac94efbdb35e
UHC 闕?蛟大?暮???闕?蛟大?暮?謔?^ 11001111111101000011111111001110111100011101001111011110001111111101100110111010001111110011111100111111110011111111010000111111110011101111000111010011110111100011111111011001101110100011111111111001110011000011111101011110 cff43fcef1d3de3fd9ba3f3f3fcff43fcef1d3de3fd9ba3ff9cc3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)