To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????宥??歪?9悠ゆ????? 001111110011111100111111001111110011111100111111100101110100011100111111001111111001100001100011001111111000001001011000100101110100100110000010111001000011111100111111001111110011111100111111 3f3f3f3f3f3f97473f3f98633f8258974982e43f3f3f3f3f
EUC-JP ???佾??宥??歪?9悠ゆ?洧??孼 001111110011111100111111100011111011000011111011001111110011111111001101101010000011111100111111110011111100010000111111101000111011100111001101101010101010010011100110001111111000111111000111101101000011111100111111100011111011101011000011 3f3f3f8fb0fb3f3fcda83f3fcfc43fa3b9cdaaa4e63f8fc7b43f3f8fbac3
UTF-8 麗몃쓷佾롨땻宥븍뼅歪묐9悠ゆ갭洧뷀뜙孼 111011111010011010001000111010111010101010000011111011001001001110110111111001001011110110111110111010111010000110101000111010111001010110111011111001011010111010100101111010111011100010001101111010111011110010000101111001101010110110101010111010111010110010010000111011111011110010011001111001101000001010100000111000111000001010000110111010101011000010101101111001101011010010100111111010111011011110000000111010111001110010011001111001011010110110111100 efa688ebaa83ec93b7e4bdbeeba1a8eb95bbe5aea5ebb88debbc85e6adaaebac90efbc99e682a0e38286eab0ade6b4a7ebb780eb9c99e5adbc
UHC 麗몃쓷佾롨땻宥븍뼅歪묐9悠ゆ갭洧뷀뜙孼 1110011010110000101110001110101110011101100101001110110011101011100011101110100010001011100100011110101011101001101110101110101110010110100011111110100011100000100100011110101110100011101110011110101011101101101010101110011010110000101110001110101011111011100101001110110110001101100111001110010111101101 e6b0b8eb9d94eceb8ee88b91eae9baeb968fe8e091eba3b9eaedaae6b0b8eafb94ed8d9ce5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)