To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄??泣??恂?????音??純??鈺 10010110111011110011111100111111100010111000001100111111001111111001110010010110001111110011111100111111001111110011111110001001101110010011111100111111100011111000001100111111001111111111101111000100 96ef3f3f8b833f3f9c963f3f3f3f3f89b93f3f8f833f3ffbc4
EUC-JP 厄??泣??恂?????音??純??鈺 1100110011110001001111110011111110110101111000110011111100111111110101111111011000111111001111110011111100111111001111111011001010111011001111110011111110111101111000110011111100111111100011111110001111010101 ccf13f3fb5e33f3fd7f63f3f3f3f3fb2bb3f3fbde33f3f8fe3d5
UTF-8 厄댁뼚泣쒑굢恂ⓦ걶若뗫쵎音귛윜純껊폏鈺 111001011000111010000100111010111000110010000001111010111011110010011010111001101011001110100011111011001001001010010001111010101011010110100010111001101000000110000010111000101001001110100110111010101011000110110110111011111010010110110100111010111001011110101011111011001011010110001110111010011001111110110011111010101011011110011011111011001001110010011100111001111011010010010100111010101011101110001010111011011000111110001111111010011000100010111010 e58e84eb8c81ebbc9ae6b3a3ec9291eab5a2e68182e293a6eab1b6efa5b4eb97abecb58ee99fb3eab79bec9c9ce7b494eabb8aed8f8fe988ba
UHC 厄댁뼚泣쒑굢恂ⓦ걶若뗫쵎音귛윜純껊폏鈺 1110010011111000101101001110110010010110101000001110101111101000100111001110100010000010100010011110001011100001101010001110001110000001100111001110010110101110100010111110101110101100100100001110101111100101100000101110010110011111100111111110001011101101100000111110101110111100100110101110100010101101 e4f8b4ec96a0ebe89ce88289e2e1a8e3819ce5ae8bebac90ebe582e59f9fe2ed83ebbc9ae8ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)