To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 賊?霽?悠??豆?鎖?????????? 10010001101011110011111111101000110001110011111110010111010010010011111100111111100100111010010000111111100011011011110100111111001111110011111100111111001111110011111100111111001111110011111100111111 91af3fe8c73f97493f3f93a43f8dbd3f3f3f3f3f3f3f3f3f3f
EUC-JP 賊?霽?悠??豆?鎖?雩?????雩?? 1100001010110001001111111111000011001001001111111100110110101010001111110011111111000110101001100011111110111010101111110011111110001111111001101111101000111111001111110011111100111111001111111000111111100110111110100011111100111111 c2b13ff0c93fcdaa3f3fc6a63fbabf3f8fe6fa3f3f3f3f3f8fe6fa3f3f
UTF-8 賊렠霽렢悠꿱렍豆뤈鎖떠雩컣룽툗산┲雩첁계 111010001011001110001010111010111010000010100000111010011001110010111101111010111010000010100010111001101000001010100000111010101011111110110001111010111010000010001101111010001011000110000110111010111010010010001000111010011000111010010110111010111001011010100000111010011001101110101001111011001011101110100011111010111010001110111101111011011000100010010111111011001000001010110000111000101001010010110010111010011001101110101001111011001011001010000001111010101011001110000100 e8b38aeba0a0e99cbdeba0a2e682a0eabfb1eba08de8b186eba488e98e96eb96a0e99ba9ecbba3eba3bded8897ec82b0e294b2e99ba9ecb281eab384
UHC 賊렠霽렢悠꿱렍豆뤈鎖떠雩컣룽툗산┲雩첁계 11101110111001001000111010110001111100001011100010001110101100111110101011101101101100101110100010001110101000111101010011100111100011111011100011100001111100001011011010110000111010011110110010110000100011101011011111101110101110001000111010111011111010101010011011010100111010011110110010101010100011101011000011101000 eee48eb1f0b88eb3eaedb2e88ea3d4e78fb8e1f0b6b0e9ecb08eb7eeb88ebbeaa6d4e9ecaa8eb0e8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)