To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 夜??儒??癲??有??音〓?誼??擬??B 100101101110100100111111001111111000111011110010001111110011111111100001100111110011111100111111100101110100110000111111001111111000100110111001100000011010110000111111100010110110001000111111001111111000101101011011001111110011111101000010 96e93f3f8ef23f3fe19f3f3f974c3f3f89b981ac3f8b623f3f8b5b3f3f42
EUC-JP 夜??儒??癲??有??音〓?誼??擬??B 110011001110101100111111001111111011110011110100001111110011111111100010101000010011111100111111110011011010110100111111001111111011001010111011101000101010111000111111101101011100001100111111001111111011010110111100001111110011111101000010 cceb3f3fbcf43f3fe2a13f3fcdad3f3fb2bba2ae3fb5c33f3fb5bc3f3f42
UTF-8 夜껋눀儒몄젡癲싳뼦有뷴쮦音〓끂誼싷쬃擬⑸빜B 11100101101001001001110011101010101110111000101111101011100010001000000011100101100001001001001011101011101010101000010011101100101000001010000111100111100110011011001011101100100010111011001111101011101111001010011011100110100111001000100111101011101101111011010011101100101011101010011011101001100111111011001111100011100000001001001111101011100000011000001011101000101010101011110011101100100010111011011111101100101011001000001111100110100100111010110011100010100100011011100011101011101110011001110001000010 e5a49ceabb8beb8880e58492ebaa84eca0a1e799b2ec8bb3ebbca6e69c89ebb7b4ecaea6e99fb3e38093eb8182e8aabcec8bb7ecac83e693ace291b8ebb99c42
UHC 夜껋눀儒몄젡癲싳뼦有뷴쮦音〓끂誼싷쬃擬⑸빜B 11100101101010001000001111101100100001111010000111101010111000111011100011101100101000001001101011101111101001101001101011101100100101101010100111101010111100111011101011100101101010001000001111101011111001011010000111101011100001011011100011101011111111101001101011101111101001101001101011101011111101001010100111101011100101011011101001000010 e5a883ec87a1eae3b8eca09aefa69aec96a9eaf3bae5a883ebe5a1eb85b8ebfe9aefa69aebf4a9eb95ba42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)