To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 鼇??泣??楡り?z鼇??泣??楡り?zB 1110101010000111001111110011111110001011100000110011111100111111100111101011111010000010111010000011111101111010111010101000011100111111001111111000101110000011001111110011111110011110101111101000001011101000001111110111101001000010 ea873f3f8b833f3f9ebe82e83f7aea873f3f8b833f3f9ebe82e83f7a42
EUC-JP 鼇??泣??楡り?z鼇??泣??楡り?zB 1111001111100111001111110011111110110101111000110011111100111111110111001100000010100100111010100011111101111010111100111110011100111111001111111011010111100011001111110011111111011100110000001010010011101010001111110111101001000010 f3e73f3fb5e33f3fdcc0a4ea3f7af3e73f3fb5e33f3fdcc0a4ea3f7a42
UTF-8 鼇앸뜉泣닷뎄楡り텞z鼇앸뜉泣닷뎄楡り텞zB 111010011011110010000111111011001001010110111000111010111001110010001001111001101011001110100011111010111000101110110111111010111000111010000100111001101010010110100001111000111000001010001010111011011000010110011110011110101110100110111100100001111110110010010101101110001110101110011100100010011110011010110011101000111110101110001011101101111110101110001110100001001110011010100101101000011110001110000010100010101110110110000101100111100111101001000010 e9bc87ec95b8eb9c89e6b3a3eb8bb7eb8e84e6a5a1e3828aed859e7ae9bc87ec95b8eb9c89e6b3a3eb8bb7eb8e84e6a5a1e3828aed859e7a42
UHC 鼇앸뜉泣닷뎄楡り텞z鼇앸뜉泣닷뎄楡り텞zB 111010001010100010011101111010111000110110001100111010111110100010110100111001011011010110101100111010101111100010101010111010101011011010010101011110101110100010101000100111011110101110001101100011001110101111101000101101001110010110110101101011001110101011111000101010101110101010110110100101010111101001000010 e8a89deb8d8cebe8b4e5b5aceaf8aaeab6957ae8a89deb8d8cebe8b4e5b5aceaf8aaeab6957a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)