To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻??毅??怨??筌??愉?┬鎖??悠 111001001110100000111111001111111000101101000010001111110011111110001001100001010011111100111111111000101010001100111111001111111001011011111001001111111000010010100110100011011011110100111111001111111001011101001001 e4e83f3f8b423f3f89853f3fe2a33f3f96f93f84a68dbd3f3f9749
EUC-JP 蒻??毅??怨??筌??愉?┬鎖??悠 111010001110101000111111001111111011010110100011001111110011111110110001111001010011111100111111111001001010010100111111001111111100110011111011001111111010100010101000101110101011111100111111001111111100110110101010 e8ea3f3fb5a33f3fb1e53f3fe4a53f3fccfb3fa8a8babf3f3fcdaa
UTF-8 蒻몃뿭毅싨끽怨뀁첌筌믩끃愉억┬鎖좊씮悠 111010001001001010111011111010111010101010000011111010111011111110101101111001101010111110000101111011001000101110101000111010111000000110111101111001101000000010101000111010111000000010000001111011001011001010001100111001111010110110001100111010111010111110101001111010111000000110000011111001101000010010001001111011001001011010110101111000101001010010101100111010011000111010010110111011001010001010001010111011001001010010101110111001101000001010100000 e892bbebaa83ebbfade6af85ec8ba8eb81bde680a8eb8081ecb28ce7ad8cebafa9eb8183e68489ec96b5e294ace98e96eca28aec94aee682a0
UHC 蒻몃뿭毅싨끽怨뀁첌筌믩끃愉억┬鎖좊씮悠 1110010110110110101110001110101110010111101011011110101111110110100110101110011010110011101000111110101010110011101100101110110010101010100110011110111110100111100100101110101110000101101110011110101011110000101111101110111110100110101010001110000111110000101000001110101110011101101111111110101011101101 e5b6b8eb97adebf69ae6b3a3eab3b2ecaa99efa792eb85b9eaf0beefa6a8e1f0a0eb9dbfeaed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)