To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 厓る?壓?????悅??躍???щぉ???^ 1111101010001101100000101110100100111111100110101101100000111111001111110011111100111111001111111111101010111101001111110011111110010110111101000011111100111111001111111000010010001011100000101010011100111111001111110011111101011110 fa8d82e93f9ad83f3f3f3f3ffabd3f3f96f43f3f3f848b82a73f3f3f5e
EUC-JP 厓る?壓????????躍???щぉ???^ 1000111110110100110001111010010011101011001111111101010011011010001111110011111100111111001111110011111100111111001111110011111111001100111101100011111100111111001111111010011111101011101001001010100100111111001111110011111101011110 8fb4c7a4eb3fd4da3f3f3f3f3f3f3f3fccf63f3f3fa7eba4a93f3f3f5e
UTF-8 厓る젗壓꾧낏溜곕젣悅쎈젾躍노젾寧щぉ溜곕젛^ 111001011000111010010011111000111000001010001011111011001010000010010111111001011010001110010011111010101011111010100111111010111000001010001111111011111010011110001011111010101011001110010101111011001010000010100011111001101000001010000101111011001000111010001000111011001010000010111110111010001011101010001101111010111000010110111000111011001010000010111110111011111010011010101010110100011000100111100011100000011000100111101111101001111000101111101010101100111001010111101100101000001001101101011110 e58e93e3828beca097e5a393eabea7eb828fefa78beab395eca0a3e68285ec8e88eca0bee8ba8deb85b8eca0beefa6aad189e38189efa78beab395eca09b5e
UHC 厓る젗壓꾧낏溜곕젣悅쎈젾躍노젾寧щぉ溜곕젛^ 11100100111011011010101011101011101000001001001111100100111000101000010011101010101100111010100011101010111111101011000011101011101000001001110011100110111011011011110111101011101000001011000011100101101110001011001111101011101000001011000011100111101011001010110011101011101010101010100111101010111111101011000011101011101000001001011101011110 e4edaaeba093e4e284eab3a8eafeb0eba09ce6edbdeba0b0e5b8b3eba0b0e7acacebaaa9eafeb0eba0975e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)