To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 闌ッ蝎ィ雜ウ闖エ蛻サ闥占険蝎ィ邨り険蝎ィ雜ウ 111010001000110010101111111001011001100110101000111010001011011010110011111010001000111110110100111001011000100010111011111010001001001010010000111010001000110010101111111001011001100110101000111001111011010110000010111010001000110010101111111001011001100110101000111010001011011010110011 e88cafe599a8e8b6b3e88fb4e588bbe89290e88cafe599a8e7b582e88cafe599a8e8b6b3
EUC-JP 闌ッ蝎ィ雜ウ闖エ蛻サ闥占険蝎ィ邨り険蝎ィ雜ウ 1110111111101100100011101010111111101001111110011000111010101000111100001011100010001110101100111110111111101111100011101011010011101001111010001000111010111011111011111111001011000000111010101011100010110001111010011111100110001110101010001110111010110111101001001110101010111000101100011110100111111001100011101010100011110000101110001000111010110011 efec8eafe9f98ea8f0b88eb3efef8eb4e9e88ebbeff2c0eab8b1e9f98ea8eeb7a4eab8b1e9f98ea8f0b88eb3
UTF-8 闌ッ蝎ィ雜ウ闖エ蛻サ闥占険蝎ィ邨り険蝎ィ雜ウ 111010011001011110001100111011111011110110101111111010001001110110001110111011111011110110101000111010011001101110011100111011111011110110110011111010011001011110010110111011111011110110110100111010001001101110111011111011111011110110111011111010011001011110100101111001011000110110100000111010011001100110111010111010001001110110001110111011111011110110101000111010011000001010101000111000111000001010001010111010011001100110111010111010001001110110001110111011111011110110101000111010011001101110011100111011111011110110110011 e9978cefbdafe89d8eefbda8e99b9cefbdb3e99796efbdb4e89bbbefbdbbe997a5e58da0e999bae89d8eefbda8e982a8e3828ae999bae89d8eefbda8e99b9cefbdb3
UHC ??蝎?雜?闖????占?蝎?邨り?蝎?雜? 00111111001111111100101011101001001111111110110111011010001111111111011111100110001111110011111100111111001111111110111110111111001111111100101011101001001111111111010110111110101010101110101000111111110010101110100100111111111011011101101000111111 3f3fcae93fedda3ff7e63f3f3f3fefbf3fcae93ff5beaaea3fcae93fedda3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)