To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 厄μ????音??鴨??厄μ????音??鴨??B 100101101110111110000011110010100011111100111111001111110011111110001001101110010011111100111111100010101001101100111111001111111001011011101111100000111100101000111111001111110011111100111111100010011011100100111111001111111000101010011011001111110011111101000010 96ef83ca3f3f3f3f89b93f3f8a9b3f3f96ef83ca3f3f3f3f89b93f3f8a9b3f3f42
EUC-JP 厄μ?堉??音??鴨??厄μ?堉??音??鴨??B 11001100111100011010011011001100001111111000111110110111111111010011111100111111101100101011101100111111001111111011001111111011001111110011111111001100111100011010011011001100001111111000111110110111111111010011111100111111101100101011101100111111001111111011001111111011001111110011111101000010 ccf1a6cc3f8fb7fd3f3fb2bb3f3fb3fb3f3fccf1a6cc3f8fb7fd3f3fb2bb3f3fb3fb3f3f42
UTF-8 厄μ쥙堉득뿬音쀬꽀鴨앹츪厄μ쥙堉득뿬音쀬꽀鴨앹츪B 1110010110001110100001001100111010111100111011001010010110011001111001011010000010001001111010111001001110011101111010111011111110101100111010011001111110110011111011001000000010101100111010101011110110000000111010011011010010101000111011001001010110111001111011001011100010101010111001011000111010000100110011101011110011101100101001011001100111100101101000001000100111101011100100111001110111101011101111111010110011101001100111111011001111101100100000001010110011101010101111011000000011101001101101001010100011101100100101011011100111101100101110001010101001000010 e58e84cebceca599e5a089eb939debbface99fb3ec80aceabd80e9b4a8ec95b9ecb8aae58e84cebceca599e5a089eb939debbface99fb3ec80aceabd80e9b4a8ec95b9ecb8aa42
UHC 厄μ쥙堉득뿬音쀬꽀鴨앹츪厄μ쥙堉득뿬音쀬꽀鴨앹츪B 11100100111110001010010111101100101000101000111011101011101111001011010111100110100101111010110011101011111001011001011111101100100001001001010111100100111001011001110111101100101011101001111111100100111110001010010111101100101000101000111011101011101111001011010111100110100101111010110011101011111001011001011111101100100001001001010111100100111001011001110111101100101011101001111101000010 e4f8a5eca28eebbcb5e697acebe597ec8495e4e59decae9fe4f8a5eca28eebbcb5e697acebe597ec8495e4e59decae9f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)