To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 岳???????????蚓??濡リ?癲??葵B 1000101001111000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111111100101011011010011111100111111100101000100011110000011100010100011111111100001100111110011111100111111100010001010100001000010 8a783f3f3f3f3f3f3f3f3f3f3fe56d3f3f9447838a3fe19f3f3f88a842
EUC-JP 岳??庾??洹?????蚓??濡リ?癲??葵B 101100111101100100111111001111111000111110111100110011100011111100111111100011111100011110111010001111110011111100111111001111110011111111101001110011100011111100111111110001111010100010100101111010100011111111100010101000010011111100111111101100001010101001000010 b3d93f3f8fbcce3f3f8fc7ba3f3f3f3f3fe9ce3f3fc7a8a5ea3fe2a13f3fb0aa42
UTF-8 岳묒빖庾삯윀洹욌꽧凉쏆슖蚓곩뼦濡リ콟癲섍퉮葵B 11100101101100101011001111101011101011001001001011101011101110011001011011100101101110101011111011101100100000101010111111101100100111001000000011100110101101001011100111101100100110101000110011101010101111011010011111101111101001011011100111101100100011111000011011101100100010101001011011101000100110101001001111101010101100111010100111101011101111001010011011100110101111111010000111100011100000111010101011101100101111011001111111100111100110011011001011101100100001001000110111101101100010011010111011101000100100011011010101000010 e5b2b3ebac92ebb996e5babeec82afec9c80e6b4b9ec9a8ceabda7efa5b9ec8f86ec8a96e89a93eab3a9ebbca6e6bfa1e383aaecbd9fe799b2ec848ded89aee891b542
UHC 岳묒빖庾삯윀洹욌꽧凉쏆슖蚓곩뼦濡リ콟癲섍퉮葵B 111001001011111110010001111011001001010110111000111010101110110010111011111010011001111110001011111010101011011110011110111010111000010010110010111001011011110010011011111011001001101010100101111011001110001010000001111001011001011010101001111010111010000110101011111010101011000110010111111011111010011010011000111010101011100110000110110100001010110101000010 e4bf91ec95b8eaecbbe99f8beab79eeb84b2e5bc9bec9aa5ece281e596a9eba1abeab197efa698eab986d0ad42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)