To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????椅?????譽??吟??徇??B 0011111100111111001111110011111100111111001111111000100011010110001111110011111100111111001111110011111111100110101000110011111100111111100010111110000100111111001111111001110001101101001111110011111101000010 3f3f3f3f3f3f88d63f3f3f3f3fe6a33f3f8be13f3f9c6d3f3f42
EUC-JP ???彛??椅?????譽??吟??徇??B 00111111001111110011111110001111101111001111101000111111001111111011000011011000001111110011111100111111001111110011111111101100101001010011111100111111101101101110001100111111001111111101011111001110001111110011111101000010 3f3f3f8fbcfa3f3fb0d83f3f3f3f3feca53f3fb6e33f3fd7ce3f3f42
UTF-8 琉뗨꽮彛쀬깦椅됥걩溜싲쨱譽뱀춻吟섏뿁徇믩젡B 11101111101001111000110011101011100101111010100011101010101111011010111011100101101111011001101111101100100000001010110011101010101110011010011011100110101001001000010111101011100100001010010111101010101100011010100111101111101001111000101111101100100010111011001011101100101010001011000111101000101011011011110111101011101100011000000011101100101101101011101111100101100100001001111111101100100001001000111111101011101111111000000111100101101111101000011111101011101011111010100111101100101000001010000101000010 efa78ceb97a8eabdaee5bd9bec80aceab9a6e6a485eb90a5eab1a9efa78bec8bb2eca8b1e8adbdebb180ecb6bbe5909fec848febbf81e5be87ebafa9eca0a142
UHC 琉뗨꽮彛쀬깦椅됥걩溜싲쨱譽뱀춻吟섏뿁徇믩젡B 11101011101001001000101111101000100001001011100111101100101011011001011111101100100000111001100011101011111101011000100111100011100000011001001011101010111111101001101011101011101001001000101111100111111000101011100111101100101011011001011111101011111000011001100011101100100101111000100111100010110111111001001011101011101000001001101001000010 eba48be884b9ecad97ec8398ebf589e38192eafe9aeba48be7e2b9ecad97ebe198ec9789e2df92eba09a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)