To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????v???????vB 0011111100111111001111110011111100111111001111110011111101110110001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f763f3f3f3f3f3f3f7642
SJIS-WIN ??邯???丈v??邯???丈vB 001111110011111111100111101101100011111100111111001111111000111111100100011101100011111100111111111001111011011000111111001111110011111110001111111001000111011001000010 3f3fe7b63f3f3f8fe4763f3fe7b63f3f3f8fe47642
EUC-JP ??邯???丈v??邯???丈vB 001111110011111111101110101110000011111100111111001111111011111011100110011101100011111100111111111011101011100000111111001111110011111110111110111001100111011001000010 3f3feeb83f3f3fbee6763f3feeb83f3f3fbee67642
UTF-8 룶깻邯룶절룫丈v룶깻邯룶절룫丈vB 111010111010001110110110111010101011100110111011111010011000001010101111111010111010001110110110111011001010000010001000111010111010001110101011111001001011100010001000011101101110101110100011101101101110101010111001101110111110100110000010101011111110101110100011101101101110110010100000100010001110101110100011101010111110010010111000100010000111011001000010 eba3b6eab9bbe982afeba3b6eca088eba3abe4b88876eba3b6eab9bbe982afeba3b6eca088eba3abe4b8887642
UHC 룶깻邯룶절룫丈v룶깻邯룶절룫丈vB 10001111101010111011001010100010110010101111101110001111101010111100000011111101100011111010001011101101110110110111011010001111101010111011001010100010110010101111101110001111101010111100000011111101100011111010001011101101110110110111011001000010 8fabb2a2cafb8fabc0fd8fa2eddb768fabb2a2cafb8fabc0fd8fa2eddb7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)