To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????厓る????魚?お???永?? 0011111100111111001111110011111100111111001111111111101010001101100000101110100100111111001111110011111100111111100010111001101100111111100000101010100000111111001111110011111110001001011010010011111100111111 3f3f3f3f3f3ffa8d82e93f3f3f3f8b9b3f82a83f3f3f89693f3f
EUC-JP 獒?????厓る?獒??魚?お???永?? 10001111110010111011101100111111001111110011111100111111001111111000111110110100110001111010010011101011001111111000111111001011101110110011111100111111101101011111101100111111101001001010101000111111001111110011111110110001110010100011111100111111 8fcbbb3f3f3f3f3f8fb4c7a4eb3f8fcbbb3f3fb5fb3fa4aa3f3f3fb1ca3f3f
UTF-8 獒앯텚溜욌젛厓る젒獒앲꽋魚꿰お溜김짔永낇듅 111001111000110110010010111011001001010110101111111011011000010110011010111011111010011110001011111011001001101010001100111011001010000010011011111001011000111010010011111000111000001010001011111011001010000010010010111001111000110110010010111011001001010110110010111010101011110110001011111010011010110110011010111010101011111110110000111000111000000110001010111011111010011110001011111010101011100110000000111011001010011110010100111001101011000010111000111010111000001010000111111010111001001110000101 e78d92ec95afed859aefa78bec9a8ceca09be58e93e3828beca092e78d92ec95b2eabd8be9ad9aeabfb0e3818aefa78beab980eca794e6b0b8eb8287eb9385
UHC 獒앯텚溜욌젛厓る젒獒앲꽋魚꿰お溜김짔永낇듅 111010001010001110011101111001111011011010010011111010101111111010011110111010111010000010010111111001001110110110101010111010111010000010010001111010001010001110011101111010001000010010011011111001011110000010110010111001111010101010101010111010101111111010110001111010001010001110011101111001111011010110000101111011011000101010111001 e8a39de7b693eafe9eeba097e4edaaeba091e8a39de8849be5e0b2e7aaaaeafeb1e8a39de7b585ed8ab9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)