To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 證??第臀??苡?v證??第臀??苡?vB 1110011010011010001111110011111110010001111001101110010001011100001111110011111111100100100011110011111101110110111001101001101000111111001111111001000111100110111001000101110000111111001111111110010010001111001111110111011001000010 e69a3f3f91e6e45c3f3fe48f3f76e69a3f3f91e6e45c3f3fe48f3f7642
EUC-JP 證??第臀??苡?v證??第臀??苡?vB 1110101111111010001111110011111111000010111010001110011110111101001111110011111111100111111011110011111101110110111010111111101000111111001111111100001011101000111001111011110100111111001111111110011111101111001111110111011001000010 ebfa3f3fc2e8e7bd3f3fe7ef3f76ebfa3f3fc2e8e7bd3f3fe7ef3f7642
UTF-8 證불렚第臀렑렣苡햇v證불렚第臀렑렣苡햇vB 111010001010110110001001111010111011011010001000111010111010000010011010111001111010110010101100111010001000011110000000111010111010000010010001111010111010000010100011111010001000101110100001111011011001011010000111011101101110100010101101100010011110101110110110100010001110101110100000100110101110011110101100101011001110100010000111100000001110101110100000100100011110101110100000101000111110100010001011101000011110110110010110100001110111011001000010 e8ad89ebb688eba09ae7acace88780eba091eba0a3e88ba1ed968776e8ad89ebb688eba09ae7acace88780eba091eba0a3e88ba1ed96877642
UHC 證불렚第臀렑렣苡햇v證불렚第臀렑렣苡햇vB 111100011111101110111010110100101000111010101101111100001010111111010100111010111000111010100110100011101011010011101100101111101100011111011110011101101111000111111011101110101101001010001110101011011111000010101111110101001110101110001110101001101000111010110100111011001011111011000111110111100111011001000010 f1fbbad28eadf0afd4eb8ea68eb4ecbec7de76f1fbbad28eadf0afd4eb8ea68eb4ecbec7de7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)