To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????G 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f47
SJIS-WIN ???曖?????鵝????????釗??G 00111111001111110011111110011110010000100011111100111111001111110011111100111111111010100100000000111111001111110011111100111111001111110011111100111111001111111111101110111011001111110011111101000111 3f3f3f9e423f3f3f3f3fea403f3f3f3f3f3f3f3ffbbb3f3f47
EUC-JP ???曖?????鵝??嫄?????釗??G 00111111001111110011111111011011101000110011111100111111001111110011111100111111111100111010000100111111001111111000111110111010101000010011111100111111001111110011111100111111100011111110001110100110001111110011111101000111 3f3f3fdba33f3f3f3f3ff3a13f3f8fbaa13f3f3f3f3f8fe3a63f3f47
UTF-8 溜삘뵗曖쒋뵗嶪썩뵗鵝욋뵗嫄겸뵗溜잙졎釗숇젻G 11101111101001111000101111101100100000101001100011101011101101011001011111100110100110111001011011101100100100101000101111101011101101011001011111100101101101101010101011101100100011011010100111101011101101011001011111101001101101011001110111101100100110101000101111101011101101011001011111100101101010111000010011101010101100101011100011101011101101011001011111101111101001111000101111101100100111101001100111101100101000011000111011101001100001111001011111101100100010001000011111101100101000001011101101000111 efa78bec8298ebb597e69b96ec928bebb597e5b6aaec8da9ebb597e9b59dec9a8bebb597e5ab84eab2b8ebb597efa78bec9e99eca18ee98797ec8887eca0bb47
UHC 溜삘뵗曖쒋뵗嶪썩뵗鵝욋뵗嫄겸뵗溜잙졎釗숇젻G 11101010111111101011101111100010100101001001100111100100111100101001110011100010100101001001100111100101111101011011110111100010100101001001100111100100101111011011111111100010100101001001100111101010101100011011000011100010100101001001100111101010111111101001111111101011101000001011101111100001111100101001100111101011101000001010111001000111 eafebbe29499e4f29ce29499e5f5bde29499e4bdbfe29499eab1b0e29499eafe9feba0bbe1f299eba0ae47

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)