To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 永??鎰??袁ы?v永??鎰??袁ы?vB 1000100101101001001111110011111111101000010011000011111100111111111001011100110110000100100011010011111101110110100010010110100100111111001111111110100001001100001111110011111111100101110011011000010010001101001111110111011001000010 89693f3fe84c3f3fe5cd848d3f7689693f3fe84c3f3fe5cd848d3f7642
EUC-JP 永??鎰??袁ы?v永??鎰??袁ы?vB 1011000111001010001111110011111111101111101011010011111100111111111010101100111110100111111011010011111101110110101100011100101000111111001111111110111110101101001111110011111111101010110011111010011111101101001111110111011001000010 b1ca3f3fefad3f3feacfa7ed3f76b1ca3f3fefad3f3feacfa7ed3f7642
UTF-8 永띕쪋鎰딁독袁ы뭿v永띕쪋鎰딁독袁ы뭿vB 11100110101100001011100011101011100111011001010111101100101010101000101111101001100011101011000011101011100101001000000111101011100011111000010111101000101000101000000111010001100010111110101110101101101111110111011011100110101100001011100011101011100111011001010111101100101010101000101111101001100011101011000011101011100101001000000111101011100011111000010111101000101000101000000111010001100010111110101110101101101111110111011001000010 e6b0b8eb9d95ecaa8be98eb0eb9481eb8f85e8a281d18bebadbf76e6b0b8eb9d95ecaa8be98eb0eb9481eb8f85e8a281d18bebadbf7642
UHC 永띕쪋鎰딁독袁ы뭿v永띕쪋鎰딁독袁ы뭿vB 111001111011010110110110111010111010010110000101111011001111000010001010111001111011010110110110111010101011111010101100111011011001001010001110011101101110011110110101101101101110101110100101100001011110110011110000100010101110011110110101101101101110101010111110101011001110110110010010100011100111011001000010 e7b5b6eba585ecf08ae7b5b6eabeaced928e76e7b5b6eba585ecf08ae7b5b6eabeaced928e7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)