To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?レ?違??幽???レ?違??擬???レ?乳 001111111000001110001100001111111000100011100001001111110011111110010111010010000011111100111111001111111000001110001100001111111000100011100001001111110011111110001011010110110011111100111111001111111000001110001100001111111001001111111011 3f838c3f88e13f3f97483f3f3f838c3f88e13f3f8b5b3f3f3f838c3f93fb
EUC-JP ?レ?違??幽???レ?違??擬???レ?乳 001111111010010111101100001111111011000011100011001111110011111111001101101010010011111100111111001111111010010111101100001111111011000011100011001111110011111110110101101111000011111100111111001111111010010111101100001111111100011011111101 3fa5ec3fb0e33f3fcda93f3f3fa5ec3fb0e33f3fb5bc3f3f3fa5ec3fc6fd
UTF-8 曆レ눘違볢뵯幽꾨섶曆レ쥙違띶렚擬좉퐫曆レ눘乳 111011111010011010001011111000111000001110101100111010111000100010011000111010011000000110010101111010111011001110100010111010111011010110101111111001011011100110111101111010101011111010101000111011001000010010110110111011111010011010001011111000111000001110101100111011001010010110011001111010011000000110010101111010111001110110110110111010111010000010011010111001101001001110101100111011001010001010001001111011011001000010101011111011111010011010001011111000111000001110101100111010111000100010011000111001001011100110110011 efa68be383aceb8898e98195ebb3a2ebb5afe5b9bdeabea8ec84b6efa68be383aceca599e98195eb9db6eba09ae693aceca289ed90abefa68be383aceb8898e4b9b3
UHC 曆レ눘違볢뵯幽꾨섶曆レ쥙違띶렚擬좉퐫曆レ눘乳 1110011010110111101010111110110010000111101100011110101011011110100100111110100010010100101011011110101011101011100001001110101110111100101110111110011010110111101010111110110010100010100011101110101011011110100011011110010110001110101011011110101111110100101000001110101010111101100101001110011010110111101010111110110010000111101100011110101011100001 e6b7abec87b1eade93e894adeaeb84ebbcbbe6b7abeca28eeade8de58eadebf4a0eabd94e6b7abec87b1eae1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)