To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 淫??音?い松??n}淫??音?い松??n{^ 10001000111110100011111100111111100010011011100100111111100000101010001010001111101111000011111100111111011011100111110110001000111110100011111100111111100010011011100100111111100000101010001010001111101111000011111100111111011011100111101101011110 88fa3f3f89b93f82a28fbc3f3f6e7d88fa3f3f89b93f82a28fbc3f3f6e7b5e
EUC-JP 淫??音?い松??n}淫??音?い松??n{^ 10110000111111000011111100111111101100101011101100111111101001001010010010111110101111100011111100111111011011100111110110110000111111000011111100111111101100101011101100111111101001001010010010111110101111100011111100111111011011100111101101011110 b0fc3f3fb2bb3fa4a4bebe3f3f6e7db0fc3f3fb2bb3fa4a4bebe3f3f6e7b5e
UTF-8 淫덉눦音긷い松쎌뭇n}淫덉눦音긷い松쎌뭇n{^ 1110011010110111101010111110101110001101100010011110101110001000101001101110100110011111101100111110101010111000101101111110001110000001100001001110011010011101101111101110110010001110100011001110101110101101100001110110111001111101111001101011011110101011111010111000110110001001111010111000100010100110111010011001111110110011111010101011100010110111111000111000000110000100111001101001110110111110111011001000111010001100111010111010110110000111011011100111101101011110 e6b7abeb8d89eb88a6e99fb3eab8b7e38184e69dbeec8e8cebad876e7de6b7abeb8d89eb88a6e99fb3eab8b7e38184e69dbeec8e8cebad876e7b5e
UHC 淫덉눦音긷い松쎌뭇n}淫덉눦音긷い松쎌뭇n{^ 1110101111100010100010001110110010000111101111011110101111100101101100011110010110101010101001001110000111100110101111011110110010111001101101010110111001111101111010111110001010001000111011001000011110111101111010111110010110110001111001011010101010100100111000011110011010111101111011001011100110110101011011100111101101011110 ebe288ec87bdebe5b1e5aaa4e1e6bdecb9b56e7debe288ec87bdebe5b1e5aaa4e1e6bdecb9b56e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)